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Abstract 

The buying and selling of information is taking place at a scale unprecedented in the history of 
commerce, thanks to the formation of online marketplaces for user data. Data providing agencies 
sell user information to advertisers to allow them to match ads to viewers more effectively. In 
this paper we study the design of optimal mechanisms for a monopolistic data provider to sell 
information to a buyer, in a model where both parties have (possibly correlated) private signals 
about a state of the world, and the buyer uses information learned from the seller, along with 
his own signal, to choose an action (e.g., displaying an ad) whose payoff depends on the state of 
the world. 

We provide sufficient conditions under which there is a simple one-round protocol (i.e. a 
protocol where the buyer and seller each sends a single message, and there is a single money 
transfer) achieving optimal revenue. In these cases we present a polynomial-time algorithm 
that computes the optimal mechanism. Intriguingly, we show that multiple rounds of partial 
information disclosure (interleaved by payment to the seller) are sometimes necessary to achieve 
optimal revenue if the buyer is allowed to abort his interaction with the seller prematurely. We 
also prove some negative results about the inability of simple mechanisms for selling information 
to approximate more complicated ones in the worst case. 



*This work was done while the author was an intern in Microsoft Research SVC. He is supported by a Microsoft 
Research Fellowship. 



1 Introduction 



A growing trend in online advertisement is the usage of behavioral targeting and user informa- 
tion (like demographics) to better match advertisements to the viewer. This is possible due to 
the existence of data providing agencies, like Bluekai, Bluecava, eXelate Media, Clearsprings and 
RapLeaf, whose business consists in collecting, curating and selling information about user intent to 
advertisers. An article in NYT [8] analyzes this phenomenon and points out that data agencies are 
not exclusively an Internet phenomenon. For example, for many years companies like Acxiom and 
Experian (founded in 1969 and 1980 respectively) have been collecting information about consumer 
habits and selling this information to marketers, who then can use it to send catalogs by mail. 

A concrete situation is as follows: an advertiser has multiple different ads that he can present to 
the viewer, and the effectiveness of each of them depends on both the ad and the characteristics of 
the viewer. For example, a car maker would rather show sport car advertisement to affluent young 
bachelors while showing ads for family cars to older viewers with kids. A data providing agency 
(the seller), might have some information about the viewer generating the impression, like gender, 
age and past interaction on that site. Such information could be valuable to the advertiser as he 
would be able to use it for better targeting, and the monopolist seller would like to extract as much 
as possible out of this value as revenue. The advertiser (buyer) might have some information about 
the viewer as well, and this information might possibly be correlated with the seller's information. 
The seller has uncertainty about the buyer's information or utilities, yet possesses some belief about 
those. 

While selling information about viewers raises obvious privacy questions, it also raises fascinating 
questions of a purely economic nature. How does one quantify the value of this information? What 
is the optimal (i.e. revenue-maximizing) selling strategy for information? What are the qualitative 
differences between selling information and selling physical goods and services? How do these 
differences influence the design of markets for information, and the algorithmic problems underlying 
such markets? 

To highlight the issues inherent in such questions, it is helpful to highlight some differences 
between a seller offering n distinct goods for sale and a seller offering n bits of information. 

(1) A seller of goods can group them into bundles, offering a subset of the goods at a specified 
price. A seller of bits can do many other things: for example, sh^ can set a specified price for 
revealing the Boolean XOR of the first two bits or some more complex function of the bits. 

(2) A consumer of goods generally knows their value even before they are allocated. The value of 
a piece of information is typically not known until the information is revealed. 

(3) By the taxation principle, a buyer of goods can be assumed, without loss of generality, to be 
facing a posted-price for each bundle (that is independent of his type). A seller of information 
may, in some cases, be able to extract strictly more revenue using an interactive protocol! rather 
than posted pricing. (See Section O) 

To be sure, there are some cases in which trading goods possesses some of the characteristics of 
trading information noted above. For example, a customer in a restaurant does not necessarily know 
the quality of the food he is about to consume; in turn, this can lead to sellers using interactive 
protocols, for example, allowing the restaurant customer to try a limited sample of food for a 
reduced price (or even for free) before deciding whether to order more. We interpret such situations 
as markets in which information and goods are coupled together, i.e. revealing the quality of the 
food occurs in conjunction with selling the food itself. 

^Throughout this paper, we use female pronouns for the seller and male pronouns for the buyer. 
^In this paper we will use the terms protocol and mechanism interchangeably. 
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This paper addresses some of these questions raised above, by situating them in a model that 
ehminates extraneous features — such as couphng of goods and information, or competition between 
multiple buyers and sellers of information — while attempting to remain quite general in the model's 
assumptions about information and its utility. Our only such assumption is that the utility of 
information lies in guiding future actions of the party receiving the information. Thus, in our 
model there is a single seller and single buyer. A state of the world (denoted by oj henceforth) is 
known to the seller but not the buyeiH, and the buyer's payoff type (denoted by 9) is known to 
the buyer but not the seller. The two parties engage in an interactive protocol, consisting of one 
or more rounds in which signals and/or money are exchanged. After this interaction, the buyer 
chooses an action (based on his posterior beliefs) and receives a payoff that depends on the state 
of the world, his own payoff type, and the chosen action. 

Crucially, we assume that the seller designs the protocol and can be trusted to faithfully follow 
the protocol he designs. The buyer, on the other hand, need not be honest: he may send signals 
that are inconsistent with his true payoff type if it is rational to do so. We do, however, distinguish 
between committed buyers — who can be committed to complete the specified protocol even if 
they are sending dishonest signals — and uncommitted buyers, who may abort the protocol if it is 
rational to do so, for example when they have received information and not yet paid for it. 

Our results. The set of all interactive protocols is a large and ill-structured space. Searching 
for the revenue-maximizing one is unfathomably complex unless there is a way to limit the search 
space. Our first set of results provide the tools necessary for that. In mechanism design this is often 
done by invoking some form of the revelation principle, due to Gibbard [13] and Myerson [16]. In 
their setting, buyers have private types and the seller (mechanism designer) needs to choose among 
a set of outcomes and can charge payments from the buyers. The revelation principle states that if 
a certain outcome and payments can be implemented in equilibrium of a possibly complicated and 
interactive mechanism, then it can be implemented in a simple direct revelation mechanism, where 
buyers report their types, and the mechanism chooses an outcome and payments. Moreover, this 
mechanism has a simple equilibrium where each buyer reports his type truthfully. If the outcome 
is the allocation of traditional goods, the revelation principle implies that the mechanism can be 
implemented as a protocol consisting of three steps: (i) buyers report their type, (ii) payments take 
place, (iii) outcome is determined. We say that such a mechanism has the one-round revelation 
property, a property which we define precisely later (in Definition 12. 7p . but intuitively it means that 
the buyers move only once (by declaring their type), payments happen only once and the sellers 
move only once (by choosing the outcome). 

Now, consider the case where instead of an allocation of traditional goods the outcome is the 
disclosure of information. Myerson's revelation principle still holds in the sense that any outcome 
can be implemented by a mechanism where buyers truthfully report their type in the first step. It 
is not clear, however, if the stronger property of one-round revelation still holds. After the buyer 
report his type, a sequence of payments and partial information disclosures might be required in 
order to implement a certain outcome. 

Our first set of results (Theorems 13.11 and 14. ip provide conditions under which the one-round 
revelation property holds. Their precise theorem statement, to be given in Sections [3] and 31 are a bit 
stronger: they supplement the revelation principle with additional information about the relative 
timing of signals and payments. 

Theorem 1.1 When buyers are committed, or when buyers are uncommitted but u and 9 are 

■^Our model also incorporates cases in which the buyer and seller each receive (possibly different) signals about 
the state of the world. In such cases, we simply treat the buyer's signal as being encoded in her payoff type, 6. 
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independent random variables, any mechanism can be transformed into a mechanism that extracts 
the same revenue and has the following form: the buyer and seller each sends a single message, 
payment takes place only once, the buyer's message is simply an announcement of his type, and 
truthful reporting maximizes the buyer's utility. 

Interestingly, the revelation principle fails in the remaining case, when there are uncommitted 
buyers and correlated signals. The usual logic justifying the revelation principle — that the agents 
can always report their types to the mechanism and let it simulate their optimal strategy given their 
type — does not apply for a subtle reason having to do with the timing of payments, the correlation 
of the signals, and the fact that the buyer is uncommitted. The direct mechanism that attempts 
to simulate an interactive protocol cannot determine an unbiased estimate of the buyer's expected 
payment before observing w, because, unlike in the independent case, the conditional distribution of 
uj depends on the value of 6 (the buyer's true type) and not necessarily on the type that is reported. 
On the other hand, if the mechanism simulates the protocol using the realization of uj and posts a 
price that depends on the simulation outcome, this fails because the buyers are uncommitted: the 
price reveals information about uj, and the buyers may take this information for free while refusing 
to pay. 

Our next results concern algorithms for computing the optimal mechanism. Even when the 
one-round revelation property holds, it is far from obvious how to compute the optimal mechanism 
efficiently. In a one-round revelation mechanism, the seller allocates information to the buyer 
by revealing a (possibly random) signal sampled from a distribution that depends on uj and 9. 
The main difficulty is that seller is free to choose the support size of this distribution (i.e., the 
number of potential signals) and in principle, this leads to an optimization problem of unbounded 
dimensionality. Nevertheless, we show that the optimal mechanism can be computed in polynomial 
time by solving a convex program of bounded dimensionality; a by-product of the proof is an explicit 
upper bound on the number of potential signals. 

Theorem 1.2 Suppose that uj can take only m possible values and can take only n possible 
values. When buyers are committed, or when buyers are uncommitted but uj and 9 are independent 
random variables, there is an algorithm that computes the optimal mechanism in time poly(m,n). 
Furthermore, there is an optimal mechanism in which every signal transmitted from the seller to a 
buyer is sampled from a set of size 0{m + n). 

In Theorem 14.21 we prove an analogue of the result of Cremer and McLean [10] on optimal 
auctions with correlated bids. We show that when the correlation of uj and 9 is complex enough 
that a certain matrix has full rank, the optimal mechanism extracts the full surplus. However, 
as in [To], when this matrix is ill-conditioned the optimal mechanism can be quite exotic, using a 
mixture of unboundedly large positive and negative payments. This raises the following question: 
To what extent can its revenue be approximated by simpler and more natural mechanisms? We 
explore this question in Section 14.21 by investigating the relative power of four progressively more 
general types of mechanisms: 

(i) a "sealed envelope" mechanism that treats u as an indivisible good by writing its value inside 
a sealed envelope and posts a price for the the envelope; 

(ii) mechanisms that reveal a signal about uj but charge the buyer for this signal before revealing 
it; 

(iii) mechanisms that reveal a signal about uj and then charge the buyer a non-negative amount 
that depends on the signal; 

(iv) arbitrary mechanisms. 
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It is not hard to show that if one compares the optimal mechanisms from two of these four classes, 
their revenue never differs by more a factor of more than |G|, the number of potential buyer types. 
Section 14.21 shows that this multiplicative gap is tight up to a constant factor: for any two of the 
aforementioned classes of mechanisms, one can find examples where mechanisms in the more general 
class obtain times as much revenue as the optimal mechanism in the more specific class. 

Our work leaves many interesting open problems, we discuss these problems in Section [6] and 
point out potential connections to other areas in computer science (as cryptographic) and in eco- 
nomics (as cheap talk and dynamic mechanism design). 

Related w^ork. The concept of information occupies a notable position in Auction Theory. In the 
classic work of Milgrom and Weber [15] , the authors consider different auction formats for a single 
item and discuss how revenue changes as the seller reveals information (but not directly charging for 
it) regarding the quality of the good. Persico [18] remarks that the information structure is almost 
always assumed to be exogenous and out of the control of the mechanism designer. He initiates a 
line of inquire that proposed to endogenize the information acquisition process in the auction. The 
information acquisition in his model occurs by a competing firm paying some fixed exogenous cost 
(say by performing R&D). 

It is interesting to consider the qualitative changes when information moves from an auxiliary 
device to the position of the central object being sold. It was noted by many authors that classic 
results in economics that were designed for dealing with traditional goods fail when selling infor- 
mation. Varian [21] presents such discussion in a much broader context. Although expressing some 
similar concerns on the relation with traditional goods, his definition of information is very different 
from ours. 

Closer to our model are the work of Eso and Szentes [12] and Horner and Skrzypacz [TJ]. In 
Eso and Szentes [12] the authors develop a model of consulting, in which the consultant, sells 
information about a certain random variable to a client. The assumptions on the nature on the 
random variables are orthogonal to ours: theirs are continuous random variables on [0, 1] while ours 
are discrete random variables. Also, while our utility function is generic, their utility function is 
linear. The mechanism obtained resembles our Pricing Outcomes mechanism, where the payment of 
the client depends on the action he takes (in their paper, whether the client decided to undertake a 
project or not). Given that their utility function is simpler, the structure of the disclosed information 
is also a lot simpler. 

The paper of Horner and Skrzypacz |14j is close to our model of uncommitted buyers. Their 
main goal is to design self-enforcing contracts for an agent to sell a binary High-Low signal about 
the state of the world to a firm. The authors remark: "Lack of commitment creates a hold-up 
problem: since the Agent is selling information:, once the Firm learns it, it has no reason to pay 
for it". The model again is orthogonal to ours, since the type of the Firm (information buyer) is 
known - so there is no uncertainty from the sellers perspective. On the other hand, no side has 
the ability to commit. This ability in our model leads us to a mechanism design approach to the 
problem, while the authors analyze the set of equilibria of a game very much in the spirit of the 
cheap talk literature [5]. 

Our model is also related the treatment of information in Athey and Levin [3]. The authors 
model a decision maker who chooses an action and gets a reward depending on both the action 
chosen and an unknown state of the world from which the decision maker gets an imperfect signal. 
They assume a total ordering on the space of states of the world and on the space of signals and 
restrict their attention to monotone decision processes. We, on the other hand, do not make any 
structural assumptions about the set of possible states of the world and set of possible signals 
(except being finite) and also do not assume anything about the decision problem. Our approach 
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is also different: while the authors in [5] derive comparative statics for the demand of information, 
we take a mechanism design approach. 

Also related to our paper is the work of Admati and Pfleiderer [H [2]. Motivated by financial 
advice in the market for securities, the authors analyze the following setting: a monopolist seller 
has information about how much a share of a risky asset pays off, which is distributed according to 
a normal random variable. They consider a two step process: in the first round, the seller is able 
to sell this information to a continuum of traders using some mechanism. In the second round, the 
buyers (traders) trade in a speculative market. The authors consider the problem of how to design 
mechanisms to maximize revenue. Although of a similar spirit, the model of Admati and Pfleiderer 
is orthogonal to ours. Their model considers multiple buyers that trade in the same market in 
the second round, instead of simply making a decision. On the other hand, many features of our 
model are absent in theirs: they do not consider uncertainty of the seller about the buyer's utility 
function and the fact that the buyers might already have some private information correlated with 
the seller's information. Moreover, they also limit the seller's power to transform the signal. In 
their model, the only transformation the seller is able to perform on the signal is to add normally 
distributed noise to it. 

Our notion of uncommitted buyers is in the spirit of participation constraints used in the litera- 
ture on dynamic mechanism design [3 [4]. This literature considers multi-round interaction between 
a mechanism and the agents, and for such mechanisms it is required that individual participation 
constraints hold in every period. In parallel, when considering uncommitted buyers we require that 
participation is voluntary at every point the buyer plays, that is, that the buyer does not defect 
throughout the protocol. Note that unlike that literature, our buyer only gets one exogenous signal, 
so the different "periods" are not points of getting new exogenous information, but rather points 
where he takes an action in a multi-round protocol with the seller. 

2 Setting: Context and Protocols 

We consider a setting with a buyer and a monopolist seller. The buyer is a decision maker and his 
decision can be represented as picking an action a £ A. His reward from this action depends on the 
state of the world which is unobservable. However, both buyer and seller get private signals about 
the state of the world. Let 6 € Q he the private signal of the buyer and cj S be the private signal 
of the seller. In this paper we consider the case that both Q and f2 are finite. The buyer's expected 
reward for taking action a when the two signals are realized to ^ a; is given by u{d,u!, a). The pair 
of private signals comes from a joint distribution G A{U x 0)o We denote the prior on to by the 
vector p E M^, i.e., p{uj) = fi{uj). We cah the pair {u, fj.) the context and assume it is common 
knowledge. 

To illustrate the model, suppose the buyer is an Internet advertiser who has acquired one 
display-ad slot and is deciding which ad to show to the user. So, the set A represents the possible 
ads he can place on this slot. The effectiveness of each ad depends on who the user is exactly. 
This is unknown to the buyer, but he has a private signal 9, which is the user browsing history in 
the website. Consider now the seller as a data provider who has information about age, gender, 
geographic location and income range of the user. Let this information be encapsulated in a signal uj. 

Another interpretation is to consider 9 as the type of the buyer. Since the reward of the buyer 
u{0,uj,a) is a function of 9, one can use the same model to express the seller's uncertainty about 

*As usual, given any set X we represent the set of probability distributions over X by A(X). We denote by 
d) the probability of the event {oj, 9) and by A'(cj) = X^s A'('^i ^) ^iid = ^) the marginals of u} and 6 

respectively. 
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the buyer's reward function. 

The information that the seller holds is valued by the buyer. If the buyer observe his signal 
and nothing more, his expected reward is max^ Etj[n(0, w, a)\6], where the expectation is taken over 
uj sampled from ii{-\0). If he also learns the value of uj exactly, his expected reward increases to 
E(^[maXa w, a)|0]. His surplus from knowing uj is thus: 

£^{9) = E(^[maxa u{9, u, a)\6] — max^ Ei^[u{6, w, 0)16*] 

Ideally, the seller would like to extract this extra surplus as revenue, but as she faces uncertainty 
regarding the buyer (she does not know 9) and since the buyer act strategically, generically the 
seller would not be able to extract all that surplus. The central question of this paper is, "What 
mechanisms can the seller use in order to extract the largest possible fraction of this surplus?" 

2.1 Sealed Envelope Mechanism 

Before we start exploring the space of all possible mechanisms, we present a very simple (and 
usually suboptimal) mechanism — the Sealed Envelope Mechanism. We do so in order to highlight 
some basic difficulties in designing mechanisms for selling information. In this mechanism the seller 
treats the information as if it were a regular good. She writes w on a piece of paper, puts it inside 
an envelope and then offers the envelope for a fixed price t to the buyer. If the buyer's type is 
9, then his value for the envelope equals to £,{9), his surplus from knowing oj, so the revenue is 
t ■ ¥g^^[(^{9) > t]. Note that the seller is not using her knowledge of uj in determining t. It is easy 
to optimize t to obtain the best Sealed Envelope Mechanism. 

Notice that, after seeing uj, the seller can update her belief about 9. The seller might try to 
change the mechanism in the following way: enclose uj in an envelope and sell it for price t{uj) 
maximizing t ■ IP0~/j,[C(^) ^ ^l*^]- By doing so, the seller leaks information in the prices. Upon 
observing price t{u}), the buyer gains information about uj even without buying the envelope. 

2.2 Generic Interactive Protocol 

Our first step is to define a generic interactive protocol between the buyer and the seller. We assume 
that the protocol is designed by the seller with only knowledge of the context (u, The protocol 
prescribes the behavior of the seller for each uj and we assume that the seller always follows this 
prescription. All of this is known to the buyer. 

After this we state Myerson's revelation principle for this setting and then we formally define 
the stronger notion of one-round revelation. Then we discuss to what extend it is possible to obtain 
one-round revelation mechanisms for this setting. 

Definition 2.1 (Generic interactive protocol) A generic interactive protocol for a context [u, ^i) 
is a finite decision tree defined on a set of nodes N . For each non-leaf node, let C{n) he the children 
of node n. Each non-leaf node is labeled either as a seller-node, as buyer-node or a transfer-node. 
Furthermore: 

• each seller node n has a prescription of the seller behavior, which associates for each uj a 
probability distribution over C{n). Formally, the prescription on node n is a collection of 
distributions ip'^ £ A(C(n)), one for each uj. 

• each transfer node n has only one child and has associated with it a fixed (possibly negative) 
amount t[n). 
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In practice we can think of each edge as labeled with a different message. Starting from the root, 
if the seller or buyer moves, she or he sends a message, which is represented by moving down the 
tree (picking a child). As the seller pledges to follow the protocol, her behavior (distribution of 
messages she sends conditional on uj) is encoded in seller nodes in advance. On the other hand, 
the buyer strategically decides on the messages he sends at buyer nodes given the protocol tree 
defined by the seller. Moving down the tree from a transfer node n to its child represents a money 
transfer; the value t{n) designates how much is transferred from buyer to seller, so a negative value 
represents a payment to the buyer. No action is taken at leaf nodes. In each such a node the buyer 
will update his belief about lo based on the protocol history. This belief is called the posterior 
probability distribution. 

We consider two types of strategies for the buyer: committed and uncommitted strategies. A 
committed strategy is one where we can trust the buyer to follow the entire protocol, i.e., when 
reaching a buyer node, he sends one of the messages specified by the protocol and when reaching 
a transfer node, he sends or receives the specified amount of money. An uncommitted strategy is 
one where the buyer has, on top of that, the option of defecting from the protocol in each node, 
by simply leaving the mechanism (which is formally captured by allowing him to play " _L" ) . More 
formally: 

Definition 2.2 (Buyer strategies) A committed strategy is a collection of distributions G 
A(C(n)) for each buyer-node or transfer-node n and each type 6. 

An uncommitted strategy is a collection of distributions (p^ G A(C(n)U{_L}) for each buyer- 
node or transfer-node n and each type 6. 

At this point, it is instructive to represent the Sealed Envelope Mechanism in the form of a generic 
protocol. It is a tree consisting of three interior nodes: in the root there is a buyer-node with two 
children corresponding to the messages: "Do not accept the offer" and "Accept the offer". The 
child corresponding to "Do not accept the offer" is a leaf. The other child is a transfer-node with 
the specified amount of t. Its only child is a seller-node. This seller node has a child for each w G $7 
and the seller prescription for this node is simply — 1- See Figured] for an illustration. 

Two other natural selling strategies are important in this work. Pricing Mappings refers to 
any posted-price mechanism in which the seller presents a menu of offers each having the following 
form: for a fixed amount of money, the buyer obtains the right to observe a random signal sampled 
by the seller from a distribution that depends on the value of in a pre-specified way. Pricing 
Outcomes refers to a similar type of posted-price mechanism with one crucial difference: rather 
than charging the buyer a fixed amount of money after he selects an offer from the menu, the 
amount that the buyer pays (or receives) is allowed to depend on the signal that is revealed by the 
seller. This gives the seller the potential to price-discriminate among buyers whose different types 
lead to their having different beliefs about the conditional distribution of w, and therefore different 
assessments about the expected cost of accepting a given offer. Mechanisms that price mappings or 
price outputs can easily be represented in the form of generic interactive protocols, as illustrated 
in Figured! 

It is important to notice that the seller designs the protocol solely based on the context, and 
after she designs the protocol its description becomes common knowledge. This happens before the 
pair {0,uj) is drawn. For example, the price at which the item is offered in the Sealed Envelope 
Mechanism is hard-coded in the protocol, so there is no need for the seller to send a message 
announcing it. 

Now we define the utility associated with a committed strategy for a given protocol. Each 
committed strategy (j) induces a distribution over the leaves of the tree: sample {9,uj) ~ /x, then 
start in the root and use ■0n and 4)^^ to move down the tree until a leaf is reached. Let Z be the leaf 
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Figure 1: Illustration of various protocols. A node labeled by 6 is a buyer node, a node labeled by 
■s is a seller node, and a node labeled by t is a transfer node. 

reached. For each leaf i associate r(£) the sum of the amounts of the transfer-nodes in the path 
between £ and the root. We define the utility of buyer of type 6 for (j) as: 

U{e,(l)) =E^[maXaE[u{e,LO,a)\e,Z]-T{Z)\e] 

We say that a protocol is voluntary is there is a committed strategy <j) such that 

U{6,())) > maxE^[u{6,io,a)\e], V^. 

a 

This means that the expected utility the buyer gets from participating in the protocol is at least as 
large as the utility he would get by not participating in it. A committed strategy is called optimal if 
for all 6 and for all alternative committed strategies <j)', U{9, <j)) > U (9, (j)'). The revenue extracted 
by this protocol is: E[r(Z)]. 

We can define similar concepts for uncommitted strategies: for a given node n, let T(n) be the 
sum of the amounts in the transfer nodes in the path between the root and n (not including n). An 
uncommitted strategy defines a distribution Z over the nodes of the tree (not necessarily leaves) 
and we can therefore define Z, U{9, (p), optimal strategy and revenue in the exact same way. Notice 
that every protocol is trivially voluntary for uncommitted buyers, since there is always a strategy 
guaranteeing the buyer max^ E,i_j[u{9, uj, a)\9], which is the strategy that defects at any transfer node. 

Definition 2.3 We say that it is possible to extract revenue R from a committed (uncommitted) 
buyer in a context (n, fj,) if there is a voluntary protocol for this context and an optimal committed 
(uncommitted) strategy (j) for this protocol with revenue at least R. 

Notice that in the case of committed buyers 'voluntary' is an important restriction, because 
otherwise, one could simply have a mechanism consisting solely of a transfer node of amount R and 
a leaf. For uncommitted buyers, however, the only optimal strategy in such a mechanisms would 
be to defect in the root. 

Definition 2.4 We define the optimal revenue that can be extracted from a committed (uncommit- 
ted) buyer in a context {u, fi) to be the maximum R such that for any e > it is possible to extract 
revenue R — e from a committed (uncommitted) buyer in a context {u^fx). 



2.3 Revelation Principle and One-Round Revelation Mechanisms 

First we define the concept of a revelation mechanism^ in the sense of Gibbard [13] and Myerson [16) 
and recast their celebrated revelation principle in our setting (its proof is included, for complete- 
ness, in Appendix [A]) . For our purposes it would be sufficient to formulate it in terms of revenue 
(traditional formulations are somewhat more general). 
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Definition 2.5 (Revelation Mechanism) A revelation mechanism is a protocol represented by 
a tree where the root is buyer node, where the buyer is asked to report his type. Moreover, there are 
no other buyer nodes in the tree. A strategy in such a protocol is truthful if the buyer reports his 
type truthfully in the root. 

Theorem 2.6 (Revelation Principle) Consider any context {u,^i). If it is possible to extract 
revenue R from a committed (uncommitted) buyer in the context {u,^), then there is a revelation 
mechanism and a committed (uncommitted) truthful strategy that is optimal for it and produces 
revenue R. 

Theorem 12.61 says that we can restrict our attention to Revelation Mechanisms. However, this 
still allows trees with arbitrary depths and complicated arrangements of seller and transfer nodes. 
Next we define the stronger notion of One-round revelation mechanisms., where only a very simple 
interaction between the seller and the buyer is allowed: 

Definition 2.7 (One-round Revelation Mechanism) A one-round revelation mechanism is a 
revelation mechanism represented by a tree of depth three where each path from the root has at most 
one vertex of each type (i.e. at most one seller node and at most one transfer node). 

Now we present formal definitions of two special types of One-round Revelation Mechanisms 
which were briefly discussed earlier in this section: 

Definition 2.8 (Pricing Mappings and Pricing Outcomes) A Pricing Mappings Mechanism 
is a truthful direct revelation mechanism in which all the children of the root are transfer nodes and 
all their children are seller nodes. A Pricing Outcomes Mechanism is a truthful direct revelation 
mechanism in which all the children of the root are seller nodes and all their children are transfer 
nodes. 

3 Independent signals 

In this section we analyze the case when uj and 9 are independent, which is simpler than the 
general case. In this case, the seller's belief about 6 and the buyer's belief about u are common 
knowledge. First we prove that for this setting, we can focus on One-round Revelation Mechanism 
when searching for the optimal mechanism. Then we show how to compute it efficiently using a 
convex program. Finally we show that there always exists a protocol with a fairly small tree. 

Theorem 3.1 (Existence of a One-Round Optimal Mechanism) Consider any context [u, fi) 
such that 6 and lo are independent. If it is possible to extract revenue R from a committed buyer in 
the context {u,n), then it is possible to extract revenue R from an uncommitted buyer in the same 
context using a Pricing Mappings Mechanism. 

The proof is presented in Appendix [Bj A consequence of Theorem 13. II is that with independent 
signals the fact that the buyer is committed does not help the seller to extract more revenue. In 
this setting it is possible to extract revenue R from uncommitted buyers if and only if it is possible 
to extract revenue R from committed buyers. 

Pricing Mappings Mechanism. 

Theorem 13.11 allow us to focus on Pricing Mappings Mechanisms. An alternative way of describing 
a Pricing Mappings Mechanism is as a fixed menu of contracts {(^e, te)}eGe- The contract (19,^0) 
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is intended for a buyer of type (in the sense that it would be optimal for such a buyer to choose 
that contract out of the menu). A buyer choosing the contract {Yg^tg) would pay tg and observe 
one realization of the random variable Yg that is correlated with w, taking values in a finite set Sg. 
We call the elements of Sg signals, since they reveal to the buyer some information about u. By 
buying the contract {Ygi,tgi) a buyer of type gets utilitjH Ey^, [max^ Etj[ti(0, w, a)!!^/]] — tgi. Now, 
we discuss how to design the menu of contracts in order to maximize revenue. 

To be more precise, Yg is a random variable that is produced by the seller using uj and possibly 
some random bits r that are independent of (w, 9). Without loss of generality we represent 1^ by a 
family {^g jo^en for Tpg G A.{Sg). In order to sample Yg, the seller observes uj and then samples the 
value of Yg according to . 

Without loss of generality, we can call {Yg,tg) the favorite contract of a buyer of type 9. In 
order for such set of contracts to be valid we need to make sure that: (1) the protocol is voluntary, 
i.e., the utility of a buyer of type 9 by taking contract {Yg, tg) is at least as high as his utility of not 
participating in the mechanism and acting using his belief given only 6, and (2) contract {Yg,tg) is 
indeed his favorite one, i.e., he would not strictly prefer to misreport his type and buy a contract 
{Ygi,tgi) for some 9' ^ 6. Property (1) ensures individual rationality (IR) and property (2) 
ensures incentive compatibility (IC). Formally: 

Definition 3.2 A menu of contracts {{Yg,tg)}g^Q) is valid if and only if: 

Eyg[maXaE^[n(6',w,a)|y6i]] - > maXaEj^['u(6',t^,a)], V6', {IRe) 
Ey,[max<,E^[n(0,a;,a)|ye]] -te> Ey^, [max„ E^[tx(e, t^, a)|ye']] - tg,, \f9 / 9', {ICg^,) 

Given a valid menu of contracts, its associated revenue is given by X^gg© fi{9)-tg. This definition 
implicitly assumes that whenever the buyer of type 9 is indifferent between contract {Yg,tg) and 
not buying anything, i.e. the IR constraint is tight, then he buys contract {Yg,tg). It also assumes 
that whenever he is indifferent between {Yg,tg) and {Ygi,tg'), he buys (Yg,tg). This assumption is 
without loss of generality, since given any menu of contracts {{Yg,tg)}g^Q with revenue R, for every 
e > it is possible to produce a menu {{Yg,t'Q)}g^Q with revenue {1 — e) ■ R such that all IR and 
IC inequalities hold strictly. We defer the formal proof of this fact to Lemma IC.3I 

Our goal is, for any given context, to design the valid menu of contracts with largest possible 
associated revenue. We call it the optimal menu. 

Before starting to optimize the menu, consider a couple of definitions: if Yg is a variable taking 
values in a space Sg, then for each s £ Sg, the le-posterior associated with s is the distribution 
q £ A(0) such that q{uj) = P(a;|s). We define the value of buyer of type 9 for posterior q as: 

vg{q) = maXagA q{uj)u{9, uj, a) 

which is a piecewise-linear convex function vg : — t- M. Usually vg{q) is defined for q £ A ($7), but 
sometimes it is used for vectors q > such that Xlo; '?('^) ^- '^^^ reader should note, however 
that it is a homogeneous function: vg{\q) = Xvg{q),yX > 0. 

We next show that we can represent a signal by a distribution over a finite set of posteriors (Ob- 
servation [33]) without repetition (Observation 13.31) . The proofs of the observations are immediate, 
but we include in Appendix |B] for completeness. 

Observation 3.3 Given a menu {(Yg,tg)}g, if there are s,s' G Sg such that the Yg-posteriors 
associated with them are equal, consider the menu obtained by substituting Yg by Yg, where: Yg = s 

^Since this section talks about the case that uj and 6 are independent, sampling lj ~ fj.{- \ 6) is the same as sampling 
cj ~ ^, and thus in the rest of the section when writing Et^ we mean Etj^,i(.|e) 
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for Yg = s' and Yg = Yg otherwise. The obtained menu is also valid and the revenue associated with 
it is the same as the one of the original menu. 

Given that the posteriors associated with each signal s Sg are different, we can represent Yg 
by a distribution over a finite set of posteriors, i.e., a set Q of posteriors, each q G Q being of the 
form q G A(0), and a probabihty xg{q) of each posterior q £ Q. The condition that a distribution 
over posteriors represents a random variable correlated with uj is in the fohowing observation, whose 
proof is immediate. 

Observation 3.4 Let Y be a random variable correlated with u, i.e., given ip'^ G ^('5), S = [k], Y 
is sampled by first sampling w ~ ^ and then sampling from ip'^ . Consider a set Q = {qi, . . . , q^} C 
A(f2) and x G A((5). The pair {Q,x) represent a distribution over posteriors of a random variable 
Y correlated with uo if and only if: 

T.q,eQ ^(^i) ■ = IJ-i^) Va; G Q, (F) 

From this point on, we represent each Yg as a function xg : A(il) — t- [0, 1] with finite support, 
satisfying Equation (F) for each 9. 

T,q,eQ ^e{qi) ■ Qii^) = K^) Vo; G 0, (Fg) 

For any finite set of posteriors Q C A(r2), we can formulate a restricted revenue maximization 
problem — for mechanisms that offer a menu of contracts with posteriors restricted to belong to 
Q — as LPi below with variables xg{q) for each 6 € @, q G Q and tg for each G 0. Recall that 
p G A(il) is the prior on u, i.e., p{uj) = fi{uj). 

LPi : max s.t. 

YlqVe{q)xg{q) -tg> Vg{p), MO, {IRg) 

YlqVe{q)xg{q) -tg> Y.qMq)xe'{q) - tg', VO' / 6, {ICg^g') 

YlqXeiq) ■ qi^) = K^) ^e,uj, {Fg) 

xe{q),tg>0, M6,q 

The constraints in LPi correspond to the characterization of valid contracts in Definition [3?2] and 
the feasibility of the representation of contracts as distributions over posteriors in Observation 13.41 

To work with a linear program of finite size, we have restricted q to belong to a finite set Q. It 
is conceptually easier to think of q as ranging over the entire set A(r2), in which case LPi would 
represent the revenue maximization problem in full generality. The following lemma, whose proof is 
in the appendix, shows that the restriction to a finite set of posteriors is without loss of generality. 
There is a finite set Q* that can be precomputed from knowledge of the function u alone, such that 
solving LPi with Q = Q* '\s guaranteed to produce an optimal menu of contracts. 

Lemma 3.5 (Interesting Posteriors) Given u:0xf]xj4— )'R, there is a finite set Q* C A{Q,) 
such that for all fi, the maximum revenue that can be extracted by any protocol can also be extracted 
by a protocol that is limited to use posteriors in Q* . Moreover, all the elements q £ Q* can be 
represented with polynomially many bits. 

By passing to the dual of LPi, we get an LP with O(|0p) variables. Below, we give a separation 
oracle for the dual showing that it can be solved in polynomial time. The variables of the dual are 
gg,hgM' G M and ygGM.^: 
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DLPi 



minY^g p^yg s.t. 



9e + J2e'ytei^e,e' - he',e] > KG), V6i 
ge,he,e'>0,ye£R'', ^6,6' 



Separation oracle: The second family of constraints is of size |0|, so separating it is trivial. In 
order to separate the first family we re-write the constraints in a different way. Notice that the 
vq{-) is the maximum over |^| linear functions. So, we can substitute each constraint of the first 
family for the following |^| constraints: 



for all a & A, 9 & Q and q £ Q* . Now, for fixed a, 6 we want to check if this constraint is satisfied 
by all q € Q* or find one q for which the constraint is violated. 

Relaxing the requirement q £ Q* to q £ A(r2), this is equivalent to solving the following convex 
programming problem: 



Lemma 3.6 The convex programming problem above can be solved exactly in polynomial time. 
Given an optimal solution q G A(r2), there must exist another optimal solution q* that belongs to 
Q* , and we can find such a q* in polynomial time. 

The proof (given in the appendix) assumes that the set A is finite and polynomially bounded. 
But even if A has exponential size, if one is able to solve the problem max^ E(^^gw(w, 0, a) for any 
posterior q, we can still solve the problem. 

Comparison with Sealed Envelope. The sealed envelope mechanism presented in Section 12.11 
can extract at least 1/\Q\ revenue of the optimal mechanism, by the following simple observation: 
if a there is a voluntary protocol that extracts revenue R from a certain context (n, ^), then there 
is at least one 9 for which < fi{9) -^{9), where ^{9) = Ea;[maxa m(^, w, a)] — Mujlmaxa u{9 , lo , a)] 
is the maximum surplus that can be extracted from a buyer of type 9. By setting the price of the 
envelope to t = S^{9) — e for some tiny e > 0, the mechanism guarantees revenue at least n{9) • £,{9). 
In Example IB. II we show this bound is tight by presenting a context where the sealed envelope 
mechanism can not extract more then ^^(|^) of the optimal mechanism. 

Protocols with Small Trees. We just showed how to compute the revenue optimal protocol in 
polynomial time when 9 and uj are independent. We know that the protocol has polynomial size, 
where its size is measured by the number of nodes in the tree representing the protocol. Here, 
we make it more explicit and show that there is a protocol of size 0(|r2| • |0| + We show 

that by bounding the sum of the support size of the random variables in the menu of contracts. 
This bounds the number of leaves of the One-round Revelation Mechanism (Theorem 13. ip . As the 
number of nodes in the tree representing this protocol is at most 3 times the number of leaves 
(twice the number of leaves plus one node for the buyer and |0| seller nodes), the bound on the 
size of the tree follows from the bound on the number of leaves. The proof of the next theorem is 
in Appendix [BJ 

Theorem 3.7 Let n = |0| and m = Denote the support of a vector x by \\x\\q. The program 
LPi has a solution where WxqWq < m + n — 1 for all 9, and ll^^ello ^ mn-|- (2). Moreover, there 
are settings in which support of size quadratic in n is necessary, even when m = 2. 



Eeve ^e'{(l)he',e > -(fve + [ge + T.e'^e ^e,e') a)q{uj) 



mm 
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The fact that the quadratic lower bound holds even for m = 2 (Example IB.2p is somewhat 
counterintuitive: even when the information being sold is a single bit, there are contexts in which 
revenue maximization requires using signals that consist of f](logn) bits. 

4 Correlated signals with committed buyers 

In Section [3] we have considered independent signals and observed that for that case the seller 
does not care if the buyer is committed or not. In this section we consider correlated signals 
and committed buyers, the case of correlated signals and uncommitted buyers will be discussed in 
Section [5l Throughout this section we assume that the buyer is committed. 

4.1 Pricing Outcomes Mechanism 

In the previous section we showed that if 6,uj are independent, then the optimal protocol had the 
form of offering a menu of contracts with a fixed price for each contract. In this section we show 
that this is not sufficient to optimize the revenue whenever 6, oj are correlated. In order to optimize 
revenue, we need to add a twist: we still offer a menu of options, each option having a random 
variable Yg correlated with cj and taking values in Sg. Instead of a fixed price, however, we charge a 
specific price for each outcome s £ Sg of the signaling scheme. We continue to refer to the options 
on the menu as contracts, although this means that the word contracts has a slightly different 
meaning in this section than in the preceding one (as it is no longer the case that the buyer pays 
before observing any signal). 

Why does this construction help in designing mechanisms to optimize revenue? Suppose a 
seller designs a variable Y taking values in S. Consider ■ip'^ € A(5) for each oj such that the 
seller produces s € 5 by sampling s according to ip'^ . If oj and 6 were independent, the seller 
would be always choosing the same distribution from the buyer's perspective, which would be 
P(s) = X]^/Li(t<j) • F{ip^ = s). However, since 9 and w are correlated, different buyer-types perceive 
different distributions over S: a buyer of type 9 perceives P(s|0) = " IF'('0'^ = s)- So, if we 

condition the prices on the outcomes s, two different buyer- types see different prices for the same 
contract. This increases our power of price-discrimination. For the case of committed buyers we are 
able to show the existence of an optimal One-Round Revelation Mechanism. 

Theorem 4.1 (Existence of a One-Round Optimal Mechanism) For any context (n, /i), if 
it is possible to extract revenue R from a committed buyer in this context, then there is a Pricing 
Outcomes Mechanism that does so. 

The proof, which is given in Appendix[Cl is essentially the same as the proof of Theorem l3.ll except 
for the last step. Previously this step relied on the independence of 9 and oj; here, we instead rely 
on the fact that seller nodes are situated above transfer nodes in our protocol, which eliminates the 
need to estimate expected transfers over a random execution of the original protocol, and instead 
lets us match transfers pointwise. 

It should be noted that the fact that buyers are committed to follow the mechanism until the 
end is crucial. In fact, in any protocol containing a transfer node in which the buyer needs to pay 
the seller and whose child is a leaf, the optimal uncommitted strategy would be to defect at that 
transfer node. In other words, uncommitted buyers could acquire the information and leave without 
paying. We mention one way to solve this problem: before the mechanism starts, ask a large sum 
of money from the buyer. Run the mechanism and then gives the large sum of money back to the 
buyer. This will guarantee that the buyer follows the mechanism until the end, so as not to lose his 
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initial deposit. This will add an extra level to the mechanism: one transfer node in the beginning 
to charge this large sum of money. The rebate in the end can simply blend with the last transfer. 

Pricing Outcomes Mechanism. We will describe a Pricing Outcomes Mechanism using the 
following notation. The seller designs a menu of contracts solely based on the context. The menu 
is a collection {(ie,te)}ege where is a random variable correlated with uj and taking values in 
a finite set Sq^ just like in the independent case. The payment function, however, is a function 
: cSg — 7- R and it allows for both positive and negative payments. The seller outputs s ~ Yq, tg{s), 
which is a signal and a payment request of tg(s). If tg{s) < 0, the seller transfers Itgl-s)! to the 
buyer. 

In the following, we discuss how to find the menu maximizing revenue using a convex program. 
The derivation of this convex program closely parallels the derivation of the corresponding convex 
program in the independent case. We give the full details in Appendix[Cl and here we limit ourselves 
to discussing the two most salient differences between the derivation of the convex program in the 
independent and correlated cases. 

(1) The posterior vector after receiving a given signal depends upon the buyer's type. To compare 
posteriors across different types, we adopt a common "frame of reference" — that of an outside 
observer who observes the signal sent by the seller but does not observe 9 — and we translate 
from the type-^ reference frame to the outside observer's reference frame using a matrix Dg 
that expresses Bayes' rule. 

(2) Since payments are now associated with signals, the obvious way of expressing the expected 
revenue is as a sum of products, where each term is the probability of sending a particular 
signal, xo{q), multiplied by the amount charged in the event of sending it, tQ{q). Thus, our 
program has a quadratic objective function if we treat both the probability and the transfers as 
primal variables. (This issue does not arise in Pricing Mappings, since a buyer of type 6 pays 
the same amount regardless of what signal is sent, hence the variables xg{q) do not appear in 
the objective function.) To make the objective function linear when pricing outputs, we define 
new variables ig{q) = xg{q) ■ tQ{q). Fortunately, this change of variables makes the constraints 
linear as well. 

Full Surplus Extraction. Correlation can be very valuable to the seller. In fact, if the distribution 
exhibits sufficiently complex correlation, the seller might be able to extract full surplus from the 
buyers using a Pricing Outcomes Mechanism. Given a context (n, ^u) we define the full surplus 
as the expected gain the buyer would get by learning the value of a;. In other words, the full 
surplus is Ee[^(^)], where (,{6) = Klmaxa u{6 , uj , a)\6] — max^ E[u(0, w, a)|0]. Clearly no mechanism 
can extract more then the full surplus, and extracting a 1/|B| fraction of it is trivial, even using 
a sealed envelope mechanism, as was observed in Section [3l Now, we show that if is sufficiently 
correlated, then we can extract the full surplus. Our result leverages the ideas developed by Cremer 
and McLean [HllTO] in their work on auctions with correlated bidders, although obviously the setting 
in which we apply these ideas is different from theirs. 

For a joint distribution ^ over x O we define rank(^) as the rank of the \Q\ x |0| matrix 
defined by ^{uj,6). For example, if u and 6 are independent, then rank(/i) = 1. 

Theorem 4.2 // rank(fj,) = \Q\ then the optimal Pricing Outcomes Mechanism extracts the full 
surplus. Moreover, this can be done with a single contract. 

Proof : We define one single contract in the following way. Calculate t G Mp such that: 
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Since rank(;u) = |0|, this system is guaranteed to have a feasible solution. Now, offer the following 
contract to all the buyers: the seller reveals the value of u and requests a payment of t{uj). By the 
definition of t, each buyer is indifferent between buying this contract or not buying anything, so 
the mechanism is voluntary. ■ 

Notice that one can, in the manner of Lemma lC.31 offer the above contract with price (1 — e)-t(a;) 
for each outcome, getting revenue arbitrarily close to the full surplus and making the players strictly 
prefer to buy the contract. 

At this point it is instructive to consider a concrete example for full surplus extraction. 

Example 4.3 Imagine a box that has a locker on it and there are two keys labeled with and 1, 
exactly one of which can open the box. The buyer can choose one key and try it. If he opens the 
box, he gets the object inside. Let the type of the buyer 9 encode his value zq for the object and 
some signal that gives him a hint of which might be the right key. The seller knows exactly what is 
the correct key, and let this be co. How should the seller sell the information to the buyer? 

Consider il. = A = {0, 1}, = [2] and the reward function as u{6,u},a) = zg ■ l{u} = a}, with 
z\ = 3,Z2 = 5. The joint distribution is [fJ-{uj, 9)] = [[j-g []-|]. 

Before participating in the mechanism a buyer of type 9 = 1 has interim belief (0.4, 0.6) on uj, 
so his best action is to pick key 1, getting expected reward 0.6 • 3 = 1.8. If he is able to pick key 
whenever = and key 1 whenever uj = 1, then he always get a reward of 3, so his value for the 
information is ^(1) = 3 — 1.8 = 1.2. Similarly ^(2) = 2. In order to design a contract that extracts 
full surplus, find t{u}) such that: 

0.4-t(0) + 0.6-t(l) =^(1) and 0.6 • t(0) + 0.4 • t(l) = ^(2) 

Solving this system, we get: ti(0) = 3.6,ti(l) = —0.4. This means that if the seller reveals signal 
id = 0, the buyer needs to pay 3.6, if the seller reveals oj = 1, the buyer receives 0.4 from the seller. 
Both buyer types see the full information contract being offered, but because of the correlation of 9 
and UJ, they perceive its expected cost to be different. Player of type 9 perceives the expected cost to 
be E[i(a;)|0] = t{Qi)^{uj = Q\9) + t{l)fi{u} = l\9). This feature of correlation gives the seller a great 
power to do price discrimination. 



4.2 Continuity and Approximation 

One might be tempted to conclude from Theorem [32] that for any distribution fi such that |G| < 
one is able to extract full surplus since all matrices [fJ,{u), 9)] can be approximated arbitrarily closely 
by matrices of full rank. The flaw in this argument is obvious. If one sees // as a G x Q matrix, and 
^ G M® as a vector with the surplus ^{9) in component 9, the payment vector t € in Theorem 
14.21 can be found by solving the linear system n ■ t = ^. If /i is a perturbation of, say, a rank-one 
matrix (corresponding to 9 and uj being independent) then the linear system is very ill-conditioned, 
and therefore the solution t has a very high norm. This causes t to diverge as // becomes closer to 
being independent. To illustrate this point, let us revisit Example 14.31 

Example 4.4 Consider the same reward function as in Example \4.3\ but with a different probability 
distribution: 0, = A = {0,1}, = [2], u{9,uj,a) = zq ■ l{uj = a}, with z\ = 3,^2 = 5, and 
distribution [n{uj,9)] = []:|]. 

The surplus of each buyer is given by ^(1) = 1.5, .^(2) = 2.5. Proceeding as described in Section 
one finds that the optimal mechanism is to offer one single contract that outputs uj exactly and 
costs 1.5. Clearly both types buy this contract and the expected revenue is 1.5. Notice however 
that the expected surplus is ^^{1) + ^^(2) = 2 and Theorem \4-^ guarantees that for a slightly 
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perturbed joint distribution one can extract it entirely. For example, consider fj, = [ 024999 025001 ] • 
Applying the proof of Theorem \4-^ we get the following mechanism extracting revenue 2; offer 
the contract that outputs the full information and if the outcome is uj, a player pays t(uj) where 
[i(0),t(l)] = [-24996,25004]. 

This example highlights two problems with the optimal mechanism. The most obvious is that it 
somehow abuses risk-neutrality. The optimal mechanism produces very large payments which are 
balanced by large rebates. This situation is clearly not desirable in practice. The second problem 
is that the revenue that can be extracted from a certain context might change abruptly whenever 
the context changes slightly. 

It turns out that these discontinuities in the optimal-revenue function are only unidirectional: 
as one varies the context, the revenue can abruptly decrease but it cannot abruptly increase. Fur- 
thermore, for certain restricted classes of mechanisms, the optimal revenue depends continuously 
on the context. In particular, this holds for the first three members of the following sequence of 
progressively more general types of mechanisms^ 

Sealed \ J Pricing 1 ^ f Pricing Outcomes 1 / Pricing 



Envelope J 1 Mappings J 1 t > J 1 Outcomes 

To formalize these continuity assertions, let us fix Q, Q, and A, and regard a context (n, ^) as 
a point in the topological space C = x A{Q x G) equipped with its standard topology. 

The revenue of the optimal mechanism for committed buyers (or, equivalently, the optimal Pricing 
Outcomes Mechanism) will be denoted by R{u,fj,). Similarly, we use Re,Rc,Rp respectively to 
denote the revenue of the optimal Sealed Envelope Mechanism, Pricing Mappings Mechanism, or 
Pricing Outcomes Mechanism with No Positive Transfers. The following theorem formalizes the 
continuity assertions claimed above. Recall that a function / : M'^ — )• M is lower-semicontinuous if 
for all converging sequences Xi — t- x, f{x) < liminf f{xi). 

Theorem 4.5 The function R : C ^ M is lower-semicontinuous. The functions Re,Rc : C — t- M 
are continuous. Let C° be the set of contexts {u,fi) such that ij,{uj,9) > for all lj,9 (henceforth 
called non-degenerated contexts ), the function i?p : C° — )• M is continuous. 

App endix ID . 1 1 breaks the theorem down into pieces and proves each piece. The fact that R is lower- 
semicontinuous is proven by appealing to Lemma IC.31 which shows that for any context (u, ^u), the 
optimal revenue can be approximated arbitrarily closely by mechanisms in which all of the IR and IC 
constraints are slack. Such a mechanism remains valid in a neighborhood of (u, /i), and its revenue 
varies continuously in this neighborhood, hence the optimal revenue throughout a neighborhood of 
{u, n) remains nearly as great as the optimal revenue at {u,n). As a side effect of the method of 
proof, we obtain a robustness result: if the seller believes that the context {u, /x) and designs an 
protocol extracting R{u,n), but the real context may actually be slightly different, the seller can 
design a slightly different protocol extracting revenue at least (1 — e) • R{u, /i) from any context 
{u', n') that is sufficiently close to (u, fi). We note that a proof in the same spirit for the Cremer and 
McLean setting can be found in Robert [19] . Notice that the heart of our proof is the application 
of Lemma IC.3[ In this specific ingredient, our proof is different from that of Robert. 



^ A protocol has no positive transfers if at any transfer node n the seUer never pays the buyer, i.e. t{n) > 0. We 
discuss Pricing Outcomes Mechanism with No Positive Transfers as this mechanism achieves the optimal revenue for 
committed buyers with no positive transfers. (This follows from the proof of Theorem 14. Il l 

Note that we use the term no positive transfers as it is used in literature on auction theory, to exclude mechanisms 
in which the seller pays money to the buyer. We chose to use this standard term although in our setting such a 
payment is represented by a transfer node with negative transfer t{n) < 0. 
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The same argument establishes lower-semicontinuity of the functions Re, Rc, Rp- Their upper- 
semicontinuity fohows from a compactness argument. For each of these classes of mechanisms, the 
optimal revenue can be found by solving a variant of LP2 that reflects the additional constraints 
on the mechanism. Each such variant LP has an unbounded feasible region, and the first step of 
the proof is to identify a compact (i.e. closed and bounded) subset of the feasible region that is 
guaranteed to contain the optimal LP solution, at least for all contexts in a neighborhood of a given 
{u,fi). Taking any sequence of contexts {ui,fii) converging to (tt, /z) and passing to a subsequence 
if necessary, we may assume that their optimal mechanisms Mi converge to a limit Ai belonging 
to the feasible region of the LP. This mechanism M then supplies a lower bound on the optimal 
revenue in context (n, /x) that suffices to prove upper-semicontinuity. 

Approximating Revenue. We have seen that the Sealed Envelope Mechanism achieves at least 
a (l/n)-approximation to the optimal revenue, where n = \Q\, and that this bound cannot be 
improved by more than a constant factor in the worst case. The other two classes of mechanisms 
listed above — Pricing Mappings, and Pricing Outcomes without positive transfers — are substan- 
tially simpler and more natural than Pricing Outcomes in full generality, so it would be desirable 
to approximate the optimal revenue using one of these simpler classes of mechanisms. Unfortu- 
nately, in the worst case, the approximation achieved is rather poor: we prove in Appendix ID. 21 
that there exist contexts {u, fi) such that Rc{u,fi) < Rp{u,fi) < 0(l/n) • R{u,fi). The proof is an 
application of Theorem 14.51 We carefully construct a context with independent u},6 for which the 
full surplus exceeds Rc{u,fj,) by a factor of Q{n). For such a context Pricing Mappings is optimal, 
i.e. Rc{u, fi) = Rp{u, fi) = R{u, /i). Now if we slightly perturb fi to make it into a full-rank matrix. 
Theorem 14.21 tells us that R{u,fi) jumps up by a factor of il{n) to match the full surplus, while 
Theorem 14.51 ensures that Rc{u,iJ,) and Rp{u, fi) can only change by a tiny amount. 

Finally there remains the question of whether Rc achieves a good approximation to Rp in the 
worst case. As it happens, once again the worst-case ratio between these two quantities is Q{n), 
as evidenced by Example ID. 71 in the Appendix. In this example the seller gets from a side channel 
the information about the the buyer's surplus. In a Pricing Outcomes Mechanism (even without 
positive transfers), the seller is capable of leveraging this information, while in a Pricing Mappings 
Mechanism, she is not. 

5 Correlated Signals with Uncommitted Buyers 

In this section we examine mechanisms for uncommitted buyers when signals are correlated, and 
we formulate an intriguing open question related to a surprising failure of the one-round revelation 
property. First, we review what is known about uncommitted buyers from previous sections. Theo- 
rem [3T] says that if a; and 6 are independent, the revenue-optimal mechanism is aPricing Mappings 
Mechanism. The optimal strategy for both committed and uncommitted buyers is the same in such 
a mechanism. However, when cj and 9 are correlated, the revenue-optimal mechanism for committed 
buyers is a Pricing Outcomes Mechanism (Theorem 14. ip . In such a mechanism, the buyer declares 
his type, the seller sends a signal and the buyer pays a certain amount of money that depends on 
the signal sent by the seller. Such a mechanism clearly does not work for uncommitted buyers, who 
can defect after getting the signal but before the payment. 

In section 14.11 we mentioned one way to get around this problem: before executing the Pricing 
Outcomes Mechanism the seller can require the buyer to deposit a large amount of money, then 
the mechanism executes, and after it completes the seller refunds the buyer. This mechanism has 
clear practical drawbacks: in practice a deposit-refund scheme increases the cost of participation as 
it requires the buyer to always be able to make large payments, even in cases in which at the end 



17 



of the protocol the net payment is small, or no payments are made in the execution. The latter 
case is particularly problematic as it is usually costly to establish a payment relationship (e.g. the 
buyer needs to spend time giving his credit card information) so always imposing payments might 
deter some buyers. 

A protocol has no positive transfers if at any transfer node money always goes from the 
buyer to the seller, i.e. t{n) > for every transfer node n. Once we exclude positive transfers the 
problem of designing optimal mechanisms for uncommitted buyers becomes quite challenging. We 
formulate it as the following open problem: 

Open Problem 5.1 Characterize the protocols that extract maximum revenue from uncommitted 
buyers subject to no positive transfers. In particular, is it possible to design an algorithm that 
decides, given any context (u, fi) and target revenue R, whether there exists a protocol with no 
positive transfers that extracts revenue R from uncommitted buyers? 

A natural first attempt to address this problem would be to prove, in the manner of Theorem 
13.11 and Theorem 14.11 that for this setting one can extract optimal revenue using a One-Round 
Revelation Mechanism. We show that this approach fails. 

Theorem 5.2 (Failure of the One-round Revelation Property) There exists a context (u, fi) 
with correlated to and 6 for which some generic protocol with no positive transfers extracts strictly 
more revenue from uncommitted buyers than any One-round Revelation Mechanism with no positive 
transfers. 

In order to prove this theorem, we observe that if it is possible to extract revenue R from an 
uncommitted buyer using a One-round Revelation Mechanism, then it is possible to extract the 
same revenue using a Pricing Mappings Mechanism. This follows easily from the fact that if a 
transfer node n is the last node before the leaf in a path of the protocol tree, an uncommitted buyer 
will always defect before this node if t{n) > 0. 

Then in Example 15.31 we present a context for which an interactive protocol with no positive 
transfers extracts strictly more revenue than any Pricing Mappings Mechanism. 

Example 5.3 We present a context {u, fi) for which a mechanism where the seller interacts with 
the buyer twice (producing a protocol tree of height A) extracts strictly more revenue than any direct 
revelation mechanism. 

Let Vl = {0,1}, O = {0,1} and the distribution ^ = [02 03]- Let A = {0,1} and define the 
utility such that u{6,uj,0) = oj (for a = and u{9,uj,l) = 1 — 9a; (for a = 1). As usual, since 
Q = {0, 1} we represent the posterior probability by one real number q G [0, 1], the probability of the 
event uj = \. The first part of FigurelM depicts the utility as a function of the posterior q, the utility 
is represented by functions vg : [0, 1] — )• M. In this particular case vo{-) = ^^i(') = '^'(■)- 

It is simple to see that the optimal Pricing Mappings Mechanism offers a single contract, pricing 
the full information (value of oj) at 0.4, and getting revenue of 0.4. 

We next present a protocol with no positive transfers that extracts strictly more revenue than 
the optimal Pricing Mappings Mechanism from uncommitted buyers. The protocol is represented 
by a tree of height 4 depicted in the second part of Figure \M It consists of two transfer nodes with 
amounts ti = 0.533 and t2 = 0.8. For the seller nodes, the transition probabilities are as follows: 

• node si; the seller outputs ii whenever uj = and outputs £2 whenever oj = 1. 

• node S3; the seller outputs £3 whenever uj = and outputs ^4 whenever uj = 1. 
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Figure 2: Example 15. 3t a protocol for uncommitted buyers generating strictly more revenue then 
any direct-revelation-meclianism. 

• node S2: the seller moves either to node t2 or to node £5 according to the following probabil- 
ities: P(4|a; = 0) = 0, P(4|a; = 1) = 5/6 and clearly P(t2|w) = 1 - 

Now, we claim that the optimal strategy for an uncommitted buyer is to play left (to node ti) 
whenever 9 = and play right (to node S2) whenever 6=1, and then follow the protocol (make 
the transfers when asked) without defecting. We prove this in Claim lKl\ in Appendix \M Given 
these strategies we calculate the expected revenue of the protocol, which is: ^{9 = 0) • 0.533 + ^{9 = 
1) • 0.8 • ¥{t2\e = 1) = 0.4665 > 0.4 

We have shown above that generic protocols can extract strictly more revenue than the One- 
round Revelation Mechanism when buyers are uncommitted and no positive transfers are allowed. 
One might wonder if such interactive mechanisms can extract as much revenue from uncommitted 
buyers as can be extracted from committed buyers. We show that the answer is no, as for the 
setting of Example 15.31 there is a gap between the two. 

Theorem 5.4 There exists a context with correlated to and 9 for which the optimal revenue that 
can be extracted from uncommitted buyers using a protocol with no positive transfers is strictly less 
than the optimal revenue that can be extracted from committed buyers. 

We prove the theorem in Appendix[El For context (u, /i), the optimal revenue that can be extracted 
from committed buyers using protocols that has no positive transfers is exactly the revenue Rp{u, /x) 
that can be extracted by Pricing Outcomes with No Positive Transfers Mechanism. To prove the 
claim we show in Appendix |E] that for the setting of Example 15.31 for some (5 > it is impossible 
to extract revenue of Rp{u, fi) — 5 from uncommitted buyers using protocols that have no positive 
transfers. 

The intuition behind the proof of Theorem 15.41 is the following: in this context, it is possible to 
extract the full surplus from committed buyers. In order to extract close to this much revenue from 
uncommitted buyers, the mechanism must be offering an option that results with some posterior 
very close to the full information. Now we can show that since a buyer of = 1 is paying at most 
his entire surplus, there is a deviation for a buyer of type 9 = that guarantees him almost full 
information for a price considerably below his surplus. 

6 Open Problems 

We believe the design of mechanisms for selling information is an area full of exciting possibilities. 
In particular, there might be potential connections with information theory and cryptography, that 
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could be discovered and formalized. In this section we describe some interesting open directions, 
listing them from the more precise to the more vague problems: 

1. Mechanisms for Uncommitted Buyers: The main open problem left by our paper is the 
problem of designing revenue optimal protocols for uncommitted buyers, when no positive 
transfers are allowed (Open Problem 15. ip . Theorem 15.21 reveals that multiple rounds of par- 
tial information disclosure (interleaved by payment to the seller) are sometimes necessary to 
achieve optimal revenue if the buyer is allowed to abort his interaction with the seller pre- 
maturely. Solving this open problem would probably require the development of new tools 
for directly bounding the revenue of interactive mechanisms — in sharp contrast to most 
optimal mechanisms in the literature which satisfy some version of the One-round Revelation 
Property. 

Possibly, tools from dynamic mechanism design [U [T7] and from cheap talk [5] might be 
useful in dealing with this question, since both involve repeated interactions between parties 
before an outcome is effectively implemented. Nevertheless, our setting differs from them in 
fundamental ways: the reason interaction is necessary here is different from that in dynamic 
mechanism design. In our problem, information is just revealed by nature in the beginning 
of time. No exogenous signals are revealed during the interaction between seller and buyer. 
This fact brings our model closer to cheap talk, yet there is a substantial difference as in our 
setting talking is not "cheap" in the sense that payments are required so that the conversation 
continues. 

2. Continuous Type Spaces and Structured Contexts: Our results fix a finite discrete type 
space and use tools from Linear and Convex Programming to design optimal mechanisms. Is 
it possible to obtain a general theory of "Selling Information" where the signals from nature 
come from arbitrary spaces? Eso and Szentes [Tl] are able to deal with real-valued signals 
but they severely restrict the utility function of the agents. 

Is there a natural yet less restrictive condition on the context that makes the problem tractable 
for more generic spaces? Can one come up with a general condition that would be still be 
sufficient for the optimal mechanism to have explicit and natural representation (similar to 
Myerson's mechanism), rather than being a solution to some mathematical program? 

3. Multiple buyers: In our work, we consider only two agents: a seller and a buyer. In 
the context of information advertisement, there are usually many sellers (many information- 
providing agencies) and many buyers (advertisers). Those buyers do not simply solve a 
decision problem once they acquire the information, but play a game among each other. 

A natural next step is to consider a variant of our model where there is a single seller but 
multiple buyers, who play a game after acquiring information. One needs to be careful when 
defining such problem since we must be sure that the game has an unique equilibrium or that 
for each possible outcome of the selling information phase, the Bayesian game played in the 
second phase has a focal equilibrium on which we can concentrate when reasoning about the 
first phase. Along those lines is the work of Eso and Szentes [12] and the recent paper by 
Kempe, Tardos and Syrgkanis |20] . 

4. Coupling Goods and Information: As mentioned earlier, there are many situations where 
goods and information are coupled together (for example, the restaurant example in the 
introduction). What is the correct model in which to analyze such situations? 
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5. Dynamic selling of information: We analyzed a single interaction between the buyer 
and the seller. Alternatively one could consider an ongoing (possibly interactive) relation 
between those two parties, e.g. when the buyer is interested in information regarding multiple 
impressions. Can the seller benefit from this and extract strictly more revenue from the whole 
process than she would if running the optimal auction for each query individually? It is known 
that for traditional goods, selling bundles might generate strictly more revenue than selling 
goods individually. What is the form of the analogous results for information? 

6. Computationally bounded agents: We assumed that the seller sends messages to the 
buyer coming from a certain distribution and the buyer uses Bayes' rule to learn about the 
state of the world from those messages. We do not impose computational constraints on the 
agents. If we were to assume computationally-constrained buyers, the seller could potentially 
take advantage of cryptographic primitives, for example sending an encrypted piece of infor- 
mation to the buyer and later selling the decryption key. How would such capabilities affect 
the results? Would the seller be able to exploit them in a profitable way? 
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A Proofs omitted from Section [2] 

Proof of Theorem 12.61 : The idea of the proof is simple: we add in the root a buyer node where 
the buyer is asked to report his type 6. For each branch, there is a copy of the original protocol 
where the seller simulates the behavior of the buyer. 

Formally, we fix a context and a consider a protocol represented by a tree, distributions {^J^ji^gn 
for each seller node n and transfers t{n) on each transfer node. Now, consider a set of moves for 
the buyer {4>n}ee0 each buyer and transfer node. We can assume that for any buyer node n, 
4>f^ outputs _L with zero probability, since any other strategy is weakly dominated by a strategy 
in which the buyer does not defect at the buyer node and instead defect at the first transfer node 
encountered. Let T represent this protocol. 

Now, define the protocol Tg as the same tree with all buyer nodes n substituted by seller nodes 
where the seller moves according to 0^ for all c<j G il. Now, the tree Tg consists only of transfer and 
seller nodes. 

Now, design a new protocol with a buyer node in the root with |0| outgoing edges, one cor- 
responding to each type. Attach to the 0-branch a copy of Tg. Now, it is simple to see that an 
optimal committed (uncommitted) strategy is to report the true type and then follow the protocol 
(defecting in the nodes of Tg corresponding to the nodes in T where he defected in the original 
protocol). If any deviation is profitable now, the corresponding deviation would be profitable in the 
original protocol. Also, the revenue of the truthful strategy in the new protocol is clearly the same 
as the revenue of the original strategy in the original protocol. ■ 
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B Proofs omitted from Section [3] 



Proof of Theorem 13. II : Fix the context, consider a voluntary protocol and an optimal strategy 
(j) for a committed buyer achieving revenue R. We show that we can reduce it to a protocol of the 
specified form, achieving the same revenue for uncommitted buyers. Let Z{uj, 6) be the distribution 
over leaves obtained when we start from the root and use i/'^, cj)^ to move down the tree. Also, let 
L be the set of leaves of the tree. 

In the root of the reduced protocol, we place a buyer node with |G| edges out of it. This node 
corresponds to the buyer being asked to report his type. The children of the root are transfer nodes, 
each corresponding to a distinct type 0. The amount at such a transfer node corresponds to the 
expected payment of type 9 in the mechanism, i.e., E(^[r(Z(a;, ^))]. 

Now, consider the node n who is the child of the transfer-node in the 0-branch. Add a copy of 
L as its children and for each have the seller use the distribution Z{ijj, 9). 

Now, we need to check that the new protocol is indeed equivalent to the original one. Consider 
the uncommitted strategy where the buyers report their true types in the first node and then pay 
according to the transfer node. This strategy generates the same utility and payments as the ones 
in the original protocol, so one only needs to check that it is indeed an optimal strategy for the 
buyer. 

It suffices to see that a buyer of type 9 would not rather declare 9' instead or defect in the 
middle of the reduced protocol. Defecting in the middle is clearly not beneficial, since the protocol 
is voluntary and the buyer does not learn anything until the last move. Also, if declaring 9' were 
profitable for a buyer of type 9, he would prefer to play according to (fP in the original protocol. 
This happens for two reasons. First, it generates the same distribution over leaves given oj as 
declaring 9' in the new protocol would. So the value of the deviations is the same. Second, it 
generates the same expected payment as declaring 9' in the new protocol would. This depends 
crucially on independence. A buyer playing cj)^ experiences the same expected payment regardless 
of his type, since u is drawn from the same distribution (independent of 9) and we thus visit the 
same nodes with the same probabilities in all cases. ■ 

Notice that the last step in the proof clearly does not hold if w, 9 are correlated, since a buyer of 
type 9 could change his declaration to 9' , but he can not prevent w to be sampled with probability 

Proof of Observation 13.31 : Simply notice that the utilities of each player for each contract 
in the original and new menu are the same. ■ 

Proof of Observation [3741 : Let qi G A(r2) be the posterior associated with i, i.e. qi{oj) = 
¥{uj\Y = i) and let x{qi) = P(y = i). Then: 

<<ii) ■ q^{oo) = Y.ny = i)- n^\y = i) = k^) 

Conversely, given (Q, x) satisfying (F) we can define a variable Y taking values in [k] and set its 
joint distribution with oj to be P(ci;, Y = i) = x[qi) ■ qi{u}). ■ 

Proof of Lemma 13.51 : The function u defines piecewise linear functions vg : A(r2) — t- M. 
Each of them induces a partition of A(J7) in |^| polytopes in which vg{-) is linear. One can combine 
the |0| partitions, by taking the coarser partition of A(0) that is simultaneously a subpartition of 
the one induced by vq for all 9. This way we obtain a finite partition such that for all its regions 
all the Vg functions are linear. 

Now, let Q* be the set of vertices of the regions in this partition. Given any posterior q £ A(i7), 
if it is in region R of the partition, one can write q = jiqi where q^ S Q* are vertices of region 
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R and ^,j7i = 1,7.4 > 0. Now, given a primal solution to LPi, if xg{q) > and q ^ Q*, simply 
increase xg{qi) by ^iXg{q) and decrease xg{q) to zero. By the linearity oi vg{-),vgt{-) in R, it is clear 
that the resulting solution is still feasible and has the same objective. By repeating this process as 
many times as needed, one ends with a solution where the support is in Q* . ■ 

Proof of Lemma 13.61 : Given a convex programing problem min/(x) s.t. Ax < b, in order 
to show that it can be solved exactly in polynomial time we need to show three things (see section 
5.3 of p]): 

• that the function / can be computed efficiently for each point xq in the domain. 

• that for each point xq in the domain it is possible to calculate a subgradient v £ df{xQ). A 
subgradient is a vector v £ M" such that f{x) — /{xq) > v^{x — xq) 

• that there is an optimal solution that can be expressed with polynomially-many bits. 

Point 1 above is trivial. For point number 2 we use the linearity of the subgradient: if u S dg{xQ) 
and w € dh{xQ) then v + w £ d{g + h){xQ). And since the subgradient of linear functions is trivial, 
we now need to show how to compute the subgradient of vgi{-). Notice that vgi{-) is the maximum 
of linear functions. We use the fact that if g{x) = maxj gj{x), g{xQ) = gi{xQ) and Vi G dgi{xQ), then 
Vi £ dg{xQ), since: 

g{x) - g{xo) = g{x) - gi{xo) > gi{x) - gi{xo) > vj{x - xq) 

For point 3, Lemma 13.51 savs that LPi can be written with finitely many variables. Therefore, 
the dual LP (DLPi) can be written with finitely many constraints. Now, the optimizer of the dual 
can be found by taking O(|0p) constraints, making them tight and solving the resulting linear 
system. This solution has polynomially many bits. 

Given an optimal solution q £ A{Q), for each S let a{9) be a solution to max^ Ei^^gu(w, 9, a). 
There is a polytope V C A{i}) that consists of all posteriors q' such that actions {a{9))g^Q remain 
optimal when the posterior is q' . The objective function of the convex program is linear when 
restricted to V, so the set of optimal solutions includes at least one extreme point of V, and such 
an extreme point can be found in polynomial time. Recalling the construction of the set Q* in 
the proof of Lemma 13.51 we see that all of the extreme points of V are elements of Q*, so we have 
established that our convex program has an optimal solution q* that belongs to Q* , and that we 
can find such a q* in polynomial time, as claimed in the lemma. ■ 

The following example, discussed in Section [3l reveals that the Sealed Envelope Mechanism does 
not extract more than f^(y^) of the revenue of the optimal mechanism. 

Example B.l Consider Q = {0,1}, = [n] and ^{(jj,9) = ^_^„1 . Now, for each we can 
represent u{9,uj, a) as a piecewise-linear function vg{-) on [0,1] where 

vg{q) = max[n(^, l,a) • q + u{9, 0, a) • (1 — q)] 

a 

Let vg{-) be the function interpolating the following three points: (0,0), (1 — ^,0), (1,2^), where T 
is some fixed large number. Now, the value of a buyer of type 9 for the envelope is 2^ and occurs 
with probability fi{9) = 0(2~^), so any price p will only be able to extract 0(1) revenue. 
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Now, we show a Pricing Mappings Mechanism that can extract revenue. Consider the 

menu where the price of contract 9 is tg = {1 — ^)2^ — e and outputs the posterior 1 — w.p. ^ 
and the posterior jw+i w-P- \- This is the same as outputting a variable Yg with joint distribution: 

l/(2r^+i) 1/(2(1 -T^'+i))" 
1/(2(1 -T'^+i)) l/(2r^+i) 

It is simple to check that this is a valid menu and that it generated revenue ^{{l — ^)'n). 

Proof of Theorem 13.71 : Given any context, fix an optimal solution {xg{q),tg}0 for LPi. 
We can assume = [n] and that the solution is such that ti > t2 ^ ^3 ^ • • • ^ ^n- Let Qq C A{Q) 
be the posteriors in the support of xg. We can restrict our attention to LPi with variables xg{q) 
for q £ UeQe- 

For any given 9 € [n] we show that we can substitute xg{-) by some x'q{-) with support at most 
9 + m — 1. In order to do so fix all tgi and xgi{-) for all 9' ^ 9. Now find Xg((7),ig solving the 
following system: 

maxtg s.t. 

'^ve{q)x'g{q) - t'g > may:{vg{p),max'^ve{q)xg'{q) -tg/}, (IRg) + {ICg^e',9' ^ 9) 

q&Q geQ 

Y,ye'{q)xg'{q) - tg, > Y^MlHiq) - tg, W < 9, {ICg,,g,9' < 9) 

^ x'siq) ■ q{uj) = fi{oj), Vui en (Fg) 
x'g{q) > 0, yqeQ 

The first constraint represents the IR^ combined with ICgfii for all 9' ^ 9. Since all xgi[-),tgi 
are fixed, the rhs of the inequality is constant. The second family of constraints refer to the ICgifi 
constraints for 9' < 9. 

The system of inequalities has 9 + m constraints, so a basic feasible solution has support at most 
9 + m, which corresponds to t'g and x'q{-) such that ||a;g(-)||o < 9 + m — \. Consider the solution we 
get from substituting xg[-) by x'q[-). The only constraints in LPi which can be violated are: 

^Vgi{q)xgi{q) - tg, > '^Vg{q)xg{q) - tg 

q q 

for some 9' > 9. If this happens for some 9', substitute {xgi,tgt) by {x'Q,tg). It is simple to see that 
the solution obtained is feasible and that the revenue did not decrease. 
Finally, observe that the theorem follows as 

||a^6»||o < ^^("^ — 1 + 9) = mn + 

e e 

We show in Example IB. 21 that the quadratic dependence in |0| is necessary if one wants to 
obtain the optimal revenue. ■ 
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Example B.2 Consider Q = {0, 1}, Q = [n] and as in Example \B. 1\ represent u by piecewise-linear 
convex functions vg : [0, 1] — )• M. For some small enough parameter 5 > 0, define vg as the function 
that interpolates points 

(0, 0), (1 - 56, 0), (1 - 6{9 - I), 5''-^), (1 - 5(9 - 2),6^'~'), (1 - ,5, 6'), (1, 5'-') 

For example, vi is the function interpolating (0,0), (1—5,0), (1,1). Functionv2 interpolates (0,0), (1 — 
25, 0), (1 - <5, 5^), (1, S) and so on. 

For this example we represent all the posteriors q £ A{Q) by a real number q{l), the posterior 
probability that a; = 1. 

Now, define such that fi{u! = 0) = //(w = 1) = ^. Before we define the distribution over Q we 
note two things about the primal LP: 

1. there is an optimal solution to LPi where all posteriors are in Q = {0, 1 — n5, 1 — (n — 
1)5, ... ,1 — 5,1} by the argument in the proof of the Interesting Posteriors Lemma (Lemma 

2. we can write the primal only with the variables XQ{q) for q £ Q. This generates a finite 
LP. Further notice that the polytope that defines the feasible set is define by vq(-) and the 
distribution fi{uj), both already defined. The distribution ^{9) just appears in the objective 
function. 

Since the feasible region is already defined and it is a polytope (since by restricting to Q we have 
a finite number of constraints), let V be the set of all vertices of this polytope. Define V C M_|_ as 
the set of all t such that there is {xgi ,tgr)Qi^Q G V and 9 such that t = tg. In other words, it is the 
set of all prices appearing in the vertices of the feasible regions. Given that, pick some e such that: 

e < — • ■ —. — 

n max^ ^,^y \t — v\ 

and define ^(9) = ■ ^(Iz^n-j ■ Now the context is define, we can analyze the optimal menu of 
contracts for this context. We will show that the optimal menu is unique and ll^^ello = Qin^)- 

We claim that for this value ofe, the optimal solution to the primal LP can be found by optimizing 
ti over the feasible set, fixing it, then optimizing for t2 (having xi , ti fixed), and so on. The definition 
of e guarantees that no possible gain in tg + 1, . . . ,tn can justify a decrease in tg. 

For example, for 6 = 1, if we want to find maxti with respect to the feasible set of the primal 
LP, the unique solution is to take ti to be the surplus ^[ui(0) + fi(l)] — vi{^) = ^fi(l), since 
vg{0) = vg{^) = by construction, and xi(0) = xi(l) = |. Notice that buyer of type 1 gets no 
utility from this contract (the seller is able to extract full surplus). Now, after fixing {xi,ti) we need 
to guarantee that for all the other contracts buyer with 9 = 1 is not able to extract any utility from 
them. 

We claim that there is an unique optimal solution and the support of xg is 0,1 — (9 — 1)5, 1 — 
{9 — 2)5, . . . , 1. Moreover, in the optimal solution, the utility a buyer of type 9 extracts from his own 
contract is zero, i.e., ^gXg(q)vg{q) — tg = vg{p). We show this by induction on 9: in order to find 
the optimal {xg,tg) one needs to solve the following program (we are using the fact that the players 
9' < 9 get zero utility from their own contracts): 
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max tg s.t. 

^ve{q)xe{q)-t8>0,, {IRe) 

Y,ve'{q)xe{q)-te<Q, W<e, {ICe',e'<e) 

^((7, 1 -(/)• = (1/2,1/2), {Fe) 
q&Q 

xe{q) > 0, 
Substituting the values ofvg{-), we get: 

e-i 

Y,^'^'^'xe{l-j5)>te 

S''^'-'xe{l - j6) < tg,W = 1, ...,e- 1 

j=0 

Now we claim that in the solution maximizing tg all the inequalities above must be tight. Since 
it is a triangular system, it guarantee that the support of xg needs to be of size 6 + 1. 

Consider an optimal {xg,tg) maximizing tg and suppose that one of the constraints above is not 
tight. Clearly if the IR constraint is not tight, we can simply increase tg, which is an absurd since 
the solution is optimal. 

Now, suppose that one of the IC constraints are not tight. Then let 6' be the largest 6' such that 
ICgi is not tight. Now, it must be the case that xg{l — 9' 6) > 0. In order to see that, consider two 
cases (i) 6' < 9 — 1 : this would imply that constraint ICg/^i is also slack, violating the maximality 
of 6' and (ii) 6' = 9 — 1 : this would cause the solution to be infeasible due to a conflict between 
ICg^i and IR. 

Civen that, one can construct a new solution by decreasing xg{l — 9' 5) by a small amount p 
and increasing xg{l — (9' — 1)5) by the amount 6 p. This keeps IR and IC constraints still valid, 
but violates ^q^giq,! — q) ■ xg{q) = (1/2,1/2). It is simple to see one can increase xg{0) and 
xg{l — {9 — 1)6) by suitable values to restore this constraint. By doing so, we do not violate any of 
the IC constraints and we make the IR constraint a little bit more slack, which allows us to increase 
tg . Since it was optimal from beginning, then all IC constraints must be tight. 

Now, since all IC constraints and IR constraints are tight, and together they form a triangular 
system, it is easy to see the solution must have support 9 + 1. Now, summing for all 9, we get 
X^e 11^6 llo = X^e 1 + ^ exactly matching the bound in theorem \3.7\ for m = 2. 

C Proofs omitted from Section 14.11 

Proof : [of Theorem 14. 1| Using the same notation of Theorem 13.11 we can mimic that proof, but 
whenever constructing the new protocol, we add a seher node just after the buyer node with a 
transition to a copy of L using Z{uj, 9) in the 0-branch. Now, for each child of the seller node, if it 
corresponds to a leaf i in the original protocol, add a transfer node with amount r(^). 

Again, the buyer committed strategy where each buyer reports his type truthfully generates the 
same outcomes and payments as the original protocol. We only need to argue that it is optimal. 
The argument is very similar to the one in Theorem 13. H however the inversion of the buyer and 
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seller nodes allows us to prove point 2 without independence. We notice that if a buyer of type 
9 plays using 9' he has the same distribution over leaves as he would get if he played (p^ in the 
original protocol - by the new construction, it also means the same distribution over transfers. ■ 

Computing the optimal Pricing Outcomes Mechanism. Consider a buyer of type 9 that 
buys a contract {Y,t). Let Y be represented by a family E ^{S) for each oj ^ Q.. If a signal 
s G 5 is observed by a buyer with type 9 and also by an outside observer, the two observers have 
different beliefs on a; — the buyer's belief is ^x{-\9) while the outside observer's is /x(-) — so Bayes' 
rule dictates their respective posteriors as follows. 

So, up to rescaling there is a very simple linear transformation that relates the posteriors of the 
buyer of type 9 for a signal and the posterior of the outside observer. Let Dq be an |r2| x diagonal 
matrix with ^{9\u}) in the diagonal. Then, we can write: 

- {BU) 



Given the 1-1 mapping between posteriors from the perspective of buyer of type 9 and posteriors 
with respect to an outside observer, from this point on, whenever referring to posteriors, we always 
refer to posteriors with respect to the outside observer and apply the suitable transformation when 
necessary. 

Representation of contracts. Analogously to what was done in the independent case, we repre- 
sent y by a distribution over (outside observer) posteriors. We can use Observation 13.31 to see that 
it is possible to assume w.l.o.g. that the posteriors associated with each s G 5 are different. So, we 
can represent as a distribution over the set of posteriors A(il). More formally, we represent Y by 
X : A(r2) — )• M_|_ with finite support satisfying the feasibility condition in Observation 13.41 Similarly 
t can be represented as t : A(r2) — t- M, simply by associating each signal with its posterior. Given 
that, we can represent the valuation that a buyer of type 9 has for contract (Y^t) as: 

Ey[maxE[n(a;,0,a)|0,y]] = J^FIF = s\9) • = ^ • ^^W) = 

where the first line uses the fact that vg is homogeneous of degree 1, and the transition from the 
first to the second line makes use of Bayes' rule in the form P(s|0)/P(0|s) = P(s)/P(0). The buyer's 
expected payment for this contract is given by: 

Ey[t(y = s)\9] = Y.t{s) ■ ^-^^''^'^^ = ^ Y.t{s)x{q^)Y,q'{^MO\^) = 

s&S ^ ^ ^ ^ s£S >^ 

where 1 is the vector in with all 1 coordinates. 
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Linear Programming Formulation: Now, in order to optimize the revenue from the protocol we 
need to design a menu of contracts that is valid, in the sense that player 9 indeed chooses contract 
{Yq, to) and this choice is voluntary. One can define this in a similar manner to Definition 13. 2l making 
the appropriate changes. Since the definition and follow up discussion very closely mirrors the one 
in the independent case, we proceed to designing the LP. As in the independent case, there is an 
"interesting posteriors lemma" that allows us to limit our attention to contracts whose posterior 
vectors belong to a predefined finite set Q* . This time the set Q* depends not only on u but on /i, 
since the operator Dq depends on ^i. 

Lemma C.l (Interesting Posteriors for correlated signals) Given u: QyiO-xA^M. and 
fi G A(© X Q), there is a finite set Q* C A(r2) such that for all fi, the maximum revenue that can 
be extracted by any protocol can also be extracted by a protocol that is limited to use posteriors in 
Q* . Moreover, all the elements q & Q* can be represented with polynomially many bits. 

Proof : [sketch] The proof exactly parallels the proof of Lemma \3.5\ except that now we must use 
the linear transformation Dg to translate every buyer's piecewise-linear valuation function vg into 
the outside observer's frame of reference. In other words, we partition A(r2) into polyhedral pieces 
on which each of the function q i— t- vg(Dgq) is linear, and let Q* be the set of vertices of the regions 
in this partition. ■ 

Let us continue deriving the LP formulation of the revenue maximization problem. The variables 
in the LP are {xg{q),tg{q)) for all 9 £ @,q £ Q*. We define tg{q) = tg{q) • xg{q). We write a linear 
program that expresses maximization of revenue subject to the analogues of the usual IR, IC and 
feasibility constraints for the correlated case. 

LP2 : max ^ "^{1^ Dgq) ■ ig s.t. 
9 q 

Vg{Dgq)xg{q) - {l'Dgq)ig{q) > Vg{Dgp), ^9, (IRg) 

q 

Y,MDeq)xg{q) - {l'Dgq)tg{q) > Y,MDeq)xg>{q) - {l' Dgq)ig,{q), W / 9, {ICg,g,) 
q q 

"^xgiq) ■ q{uj) = fx{uj), y9,uj, (Fg) 

q 

xg{q)>0, yq,9 

Remark C.2 Before solving LP2 to find the optimal mechanism, it is important to remark that 
the variables of the LP correspond to the posteriors as seen an outside observer, i.e., someone that 
knows /i but only observes the signal that the seller outputs and not the value of 9. Equation (BU) 
shows how to calculate the posteriors of the buyer given the posterior of the outside observer. It 
is instructive to consider the feasibility constraint (Fg) in LP2 when viewed from the perspective 
of the buyer. First, we can re-write (Fg) as: J2s^i^) ' I'^i^) = rnultiplying by l-i{9\uj) and 

substituting by the definition of qg given by the Bayesian update equation (BU) we have: 

Y,m-Qii^)-ms) = fi{io,9) 

s 

Now, by manipulating the probabilities, we get: 

Y,ns\e) ■ qlico) = Kio\9) 
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which shows that the feasibility constraints from the perspective of the buyers is simply the feasibility 
constraints from the perspective of the observer transformed by the Bayesian update (BU) operator. 

Solving LP2 can be done using exactly the same techniques used in the independent case: 
passing to the dual and using convex programming to design a separation oracle. We omit the 
details on how to solve it, since it essentially mirrors the independent case. 

Once we have a solution xg{q),tg{q) with support on Q C Q*, we want to recover a solution 
{xg,tg). Clearly one can obtain by taking ie{q) /xo{q) whenever xe{q) is positive. What if it is 
equal to zero? In order to solve this problem we take the following steps: (1) we show that there is 
one feasible solution with revenue at least (1 — e) of the optimal, where all the IC and IR constraints 
are slack (2) then we increase XQ{q) slightly for all g G Q and renormalize so that feasibility holds 
and (3) we safely calculate tg^q) = iQ{q)/xQ{q), since now we are sure that xg{q) is positive. The 
following lemmas show how to perform steps (1) and (2): 

Lemma C.3 (Strict Preferences) Given an optimal solution {xg^tg) to LP2, then there is a 
solution {xq, ig) whose objective is at least a (1 — e)-fraction of the optimal, such that all IC and IR 
constraints are slack, except possibly ICg^i when (xg,*^) = i.e., when the contracts for 6 

and 6' are exactly the same. 

Proof : First, suppose a certain IC constraint is tight, say: 

^Ve{Dgq)xe{q) - {l^Dgq)te{q) = ^Vg{Dgq)xe.{q) - il*Dgq)ig:{q). 
1 g 

Then it must be the case that {l^Dgq)tg{q) > {l^Dgq)tg'{q). Otherwise one could replace {xg,tg) 
with {xgi,tgi) as buyer 0's contract and improve revenue while still getting a feasible solution. We 
can assume w.l.o.g. that either {l^Dgq)tg{q) > {l'^Dgq)tQi{q) or {xg,tg) = {xgi,tg^). Otherwise, if 
{l^Dgq)ig{q) = {l^Dgq)ig/{q), one can simply replace {xg^ig) with {xg',ig'). 

Now, consider the menu where the contract for a buyer of type 6 is (rc^, t^) = (xg, (1 — e)tg). We 
claim that for sufficiently small e, this is feasible and all constraints are slack. 

Any constraint that held strictly is still strict for sufficiently small e. Any tight IR constraint 
becomes clearly strict for any e > 0; in addition, no IR constraint becomes violated. Now for the 
IC constraints that were previously tight and {xg,tg) / {xg',tg/), we know that (l*Dgq)tg{q) > 
{1* Dgq)tg'{q), hence the inequality becomes strict. ■ 

Lemma C.4 Given an optimal solution {xg,tg) to LP2 where all the IC and IR constraints are 
slack except possibly ICg^i when {xg,tg) = {xgi^tgi), there is another solution {xg,t'g) with the same 
objective such that xg{q) > for all q £ Q. 

Proof : Recah that p G is the prior on w, i.e., p{uj) = ^i{oj). We can assume wlog that 
p{oj) > 0,Va; S fi. (Otherwise, we can simply restrict our attention to a subset of i}). Therefore, p 
is in the relative interior of A(0). For all q G A{Q) there is r{q) G A(r2) and 7(g) G [0, 1] such that: 

P = 7(9) • 9 + (1 - 7(g)) • r{q) 

Now, consider t'g = tg for all 9 and a;g(g) = (1 — 5)xg{q). Now, for all g G Q increase x'g(q) by 
77^i{q)^ and x'g{r{q)) by 7^(1 — 7('i'))^- If ^ is sufficiently small, the IC and IR constraints are not 

violated. For the IC constraints that held with equality and {xg,tg) = {xgi,tgi), they still hold since 
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= It is trivial to check that by construction, the feasibihty constraints still hold. 

■ 

Also, we would like to observe that as observed in Theorem 13.71 for the Pricing Mappings 
Mechanism, the optimal protocol in this case can be represented by a tree of 0{n^ + mn) size. 
Given the structural similarities between LPi and LP2, the proof follows by a simple adaptation 
of the proof of Theorem 13.71 Formally: 

Observation C.5 Let n = |0| and m = The program LP2 has a solution where \\xq\\q < 

0{m + n) for all 6. 

D Proofs omitted from Section 14.21 

This appendix contains the full details of the material sketched in Section 14. 2[ 
D.l Semicontinuity Proofs 

Theorem D.l The revenue function R : C ^ M. is lower-semicontinuos. 

Proof : Consider a sequence of contexts {ui,fj,i) such that {ui,fj,i) — > {u,fi), then consider a 
revenue optimal mechanism for the context (u, /u). Invoking Lemma IC.31 we know that for every 
e > 0, there is a mechanism for context (u, /i) generating revenue at least (1 — e) • R{u, fx) such that 
all IR constraints are slack and all IC constraints for non-identical-contracts are slack. Given that 
the inequalities are strict and that the constraints are continuous in the context, then for sufficiently 
large i, the mechanism is feasible for {ui,^i), so: lim ini R{ui, Hi) > R{u,fi) — e for all e > 0. ■ 

We note that a proof in the same spirit for the Cremer and McLean setting can be found in 
Robert [19] . Notice that the heart of the proof is the application of Lemma IC.31 In this specific 
ingredient, our proof is different then the one of Robert. 

The proof of Theorem [DT] can be interpreted as a robustness result: if the seller believes that the 
context («,//) and designs an protocol extracting R{u,n), but the real context is actually slightly 
different {u' , /i'), the seller can design a protocol extracting revenue (1 — e) • R{u, ^) from any context 
{u',ij,') that is sufficiently close to {u,n). 

This does not mean, however, that the seller is close to the optimal revenue, since it might be 
the case that the a context (n', /i') allows the seller to extract a lot more revenue then (u, /i) even 
though the two contexts are close. 

Two Continuous Mechanisms. In what follows we present two mechanisms that fix the two 
highlighted problems of the Pricing Outcomes Mechanism: they are continuous and they do not 
involve transfers from the seller to the buyers. Then we compare their revenue-extration power 
with the optimal mechanism. For independent {uj,9), they extract optimal revenue, but there are 
contexts with correlated {u},9) where they extract only a r2(|B|~^)-fraction of the optimal revenue. 

1. Pricing Outcomes with No Positive Transfers: The auctioneer offers to the buyers a menu 
of contracts {{Yg,te)}e, where Yg is a random variable taking values in Sg and tg : S ^ M+. The 
only change with respect to the optimal mechanism is that we require tg{s) > 0,Vs S Sg. To find 
the optimal mechanism of this type, one can simply solve LP2 with the additional constraints that 
ie{q)>0,y9,q. 
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2. Pricing Mappings: The auctioneer offers to the buyers a menu of contracts {(Yg,tg)}g where 
Ig is a random variable taking values in Sg and tg G is a fixed value that the buyer needs to pay 
in order to get the signal Yg. The optimal mechanism of this type can be found using a simplified 
version of LP2 : simply substitute all the occurrences of J2q(.^^ ^9'l)'^9' il) ' ^9'- 

We presented four types of mechanisms which are increasing in complexity and revenue-extracting 
power. The following diagram expresses which types are a special case of which: 

r Scaled 1 1^ f Pricing 1 (- f Pricing Outcomes 1 ^ f Pricing 1 
[ Envelope J ~ \ Mappings J~\ t >0 /~l Outcomes J 

For a given context {u, n) G C, we define Re{u,n), Rc{u,fj,) and Rp{u, fi) as the optimal revenue 
that can be obtained using a Sealed Envelope Mechanism, Pricing Mappings Mechanism and Pricing 
Outcomes With No Positive Transfers Mechanism, respectively. We know that: 

Reiu,fJ.) < Rc{u,n) < Rp{u,fl) < R{u,fi) 
And by Theorem 13. II we know that for independent n, i.e, fi{uj, 6) = n{uj) • fJ.{0), we have: 

Reiu,l-i-) < Rciu,f^) = Rpiu,f^) = R{u,ii) 

First we will show that Rp, Rc and Re are continuous in C and then we analyze the gap in 
their revenue extraction power. By a similar argument then the one in Theorem ID. H it is simple to 
see that they are lower-semicontinuous. In order to prove continuity, we only need to show upper- 
semicontinuity. We say that a function / : R*"' — )• M is upper-semicontinuous if for for all converging 
sequences Xi — )• x, f{x) > limsup /(xj). A function is continuous if it is both lower-semicontinuous 
and upper-semicontinuous. 

Lemma D.2 The revenue function i?c : C — )■ M is upper-semicontinuous. 

Proof : Consider a context {u, fj.) £ C and (uj, fii) — {u, fi). Passing to a subsequence if necessary 
we can assume that Rc{ui,fj,i) converges to the limsup of the original sequence. Now, we need to 
prove that Rc{u, fi) > lim Rc{ui, ^li). We do so in two steps: (1) we show that the optimal Pricing 
Mappings Mechanism can be represented by an element in a compact set; (2) we use compactness 
arguments to take the limit of such mechanism obtaining a mechanism for context {u,fi). 

Compact representation: The revenue Rc{ui,fii) is the solution of a variant the LP2 described 
above. As the original LP2, it has O(|0p) constraints and therefore has an optimal solution that 
has support 0(|Gp). Now, the optimal mechanism for context {ui,fii) can be represented by a 
price vector G and random variables with O(|0p) support. This means that each of 
those random variables can be represented as \Q\ distributions {ipgY G A([O(|0p)]). So, the set 
of variables {Yg}g^Q admits a representation as an element of a compact set. Now, we need to 
argue that the set of vectors are bounded and therefore can be also represented as elements of a 
compact set. 

As for the price vector t*, notice that < ^* for any individually rational mechanism, where ^* 
is the surplus vector for context {ui, Hi). If ^ is the surplus vector for context (u, fi) then for i > zq, 
then 11^* — ^lloo < 1- For the i > zq, therefore f is in the compact JlelO'^l^) + 

Limit mechanism: Now, for each context {ui,^i), the optimal mechanism can be represented 
a point Mi G K where K \s a, compact set. Therefore, passing to a subsequence if necessary, we 
can assume that A^j — t- A4. Now, since IR and IC properties are preserved in the limit (since 
they are continuous properties), is a valid Pricing Mappings Mechanism for context (n, /u) and 
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its revenue is at least the limit of the revenue extracted by on {ui,fii). Therefore, Rc{u, fi) > 
lim snp Rc{ui, fii). ■ 

An even simpler compactness argument can be formulated for i?e- For Rp we need to add a mild 
non-degeneracy assumptions: we say that a context {u, fi) is non-degenerated if fi{u},6) > 0,\/uj,9. 

Lemma D.3 The revenue function Rp : C is upper- semicontinuous in the neighborhood of all 
non- degenerated context. 

Proof : As in the proof of Lemma [D. 21 the main goal is to show that the set of optimal mechanisms 
for contexts {ui,Hi) are members of a compact set and then proceed as in the second part of the 
proof of that lemma. For the first part, we proceed for the Yg as in Lemma ID.2i But we need a 
different argument to show that the transfers in the optimal mechanism are bounded. 

Given a non-degenerated context, let 5 = ^ min^^e fJ'{G\uj), then if {ui,^i) — )• (u, /i), then for 
i > io, then fii{9\uj) > 5 for all u, 9. Increasing if necessary, we can also assume that ||^* — ^||oo < 1 
for i > io- So, if tg{q) is the payment of buyer 9 for posterior q in the optimal mechanism for context 
{ui,Hi), then ig{q) = Xg{q) ■ tg{q) is in the compact set HJO' ^i^i^) + ^'^^ individually rational 
mechanism, since: 

^ve{Deq)xe{q) - {I'DeqMq) > ve{Dep) 

implies that: 

Y,vliDeq)xl{q)-vl{Dep) 

. Q 

> ^{I'Deq) ■ iiiq) > {I'Deq) ■ il{q) > 6 ■ i^q) 

Q 

Notice that inequalities crucially depend on tg{q) being non-negative. ■ 
The following theorem summarizes the previous discussion: 

Theorem D.4 The revenue functions Rc,Re : C — t- R are continuous and the revenue function 
i?p : C — 7- R is continuous around any non- degenerated context. 



m + 1 > eio) > 



D.2 Approximating Revenue 

We know from Example IB. II that the Sealed Envelope Mechanism performs very poorly in terms 
of revenue when compared to the optimal mechanism, even if the w, ^-signals are independent. A 
natural question is how do the other mechanisms perform in comparison to the optimal mechanism. 
The Pricing Mappings Mechanism is simple and allows for a much more natural implementation 
then the Pricing Outcomes. Unfortunately, as we show in this section, both the Pricing Mappings 
Mechanism and the Pricing Outcomes with no positive transfers Mechanism can perform very poorly 
compared to the Pricing Outcomes Mechanism for certain contexts: 

Lemma D.5 The revenue extracted by a Pricing Mappings Mechanism and by a Pricing Outcomes 
with no positive transfers Mechanism is at least a l/\Q\-fraction of the optimal mechanism: 

Rc{u,i2) > j^^R{u,fi), Rp{u, fi) > -^^R{u, fi) 

and those bounds are tight up to constant factors. 



33 



The bound is trivial and it follows directly from the fact that the optimal revenue is bounded by 
the surplus, and extracting full surplus from a single type is trivial. Now, we show that the bound 
is tight using the continuity of i?c(') and Rp{-)- 

Example D.6 In this example we show that there exist a context {u,fj,) such that Rc{u, fi) < 
0{l/\e\)R{u,^i) andRp{u,n) < 0{l/\e\)R{u, n). 

For n = Q = [n], we present a non- degenerated context (u, fi) where {u},6) are independent and 
the optimal revenue is a 1/n-fraction of the full surplus I-l{0)£,{9) . For this context: 



R{u, f,) = Rp{u, ii) = R,{u, /i) = - V mm 



Now, if one sees ^ as a n x n-matrix and rj is any n x n-matrix of full rank with g vi^i ^) — Oi 
then for any small t > 0, fi + trj represents a probability distribution over Q x 0, close to fj, but with 
rank(fj, + tr]) = n, therefore, the optimal mechanism extracts full revenue (Theorem \4-^ - In other 
words: 

R{u,fi + tr]) = Y,KOMO) 
e 

where £,t is the surplus for context {u,fi + trj). By continuity: 

lim i?e(n, fi+t7]) = Rc{u, /i) = - V mm 

t^Q+ n ^ — ' 

e 

while 

lun R{u,^^+tv) = Y,mm 

~^ 9 

showing that the bound it tight. So, all we need to so is to exhibit a context where the optimal 
revenue is a f3/n fraction of the full surplus for some constant (3. 

Let $7 = G = A = [n], u{6,u},a) = vg ■ l{a = u}, vg = 2^, fi{u}) = ^ and = jz^^' 
surplus is given by ^(0) = 2^(1 — so Y2g l^i^)^i^) ~ 1-2- " • revenue optimal mechanism for 
this context is a Pricing Mappings Mechanism. 

Using the construction in the Interesting Posteriors Lemma (Lemma \3. 5|) we can search for 
a solution where the only posteriors in the support are of the form qx{^) = for u £ X and 
qx{'^) = otherwise, where X is any subset of [n]. Therefore we can solve the primal LP only 
considering the xg{qx) variables, i.e., we need to solve: 



2" 
"2- 



max^"^ :^^tg s.t. 



+ , ( \Ve ^Vg 

tg+ y Xg{qx)-r-;^ > — , 

XC[n] ' ' 

E/ \ Vg ■^-^ , s Vg 

xe{qx)-rY\ ^ + xg^iqxjjj^^, 



^Xg{qx) = l,xg{qx) > 

e 

Notice one can define xg = X]xc[n] \^\^o{qx) cind re-write the LP as: 
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e 

-tg + xgve > ve/n, 

-tg + XgV0 > -te' + Xo'Ve, 

1/n <xe<l 

Now it suffices to show that the solution of the above LP is bounded by a constant as n grows. 
This can be seen very easily by showing a feasible dual. We have that: 

n-l 

-ti + xivi + > 2 [-tg+i + xe+ive+1 +te- xeV0+i\ > 

L n J ^-^ 

e=i 

substituting vg = 2^ and simplifying the expression, we get: 

y 2~^t0 < 2£„ - — - - < 2 

One might also wonder what is the relation between Rc and Rp. Clearly Rc{u, fi) > j^Rp{u, /i). 
The following simple example shows that this bound is tight. 

Example D.7 In this example we show that there exists a context {u,fi) such that Rc{u, fi) < 
0{l/\@\)Rp{u,fi). 

We consider in this example the case where the seller learns exactly the type of the buyer. We 
encode this information in his signal oj. R will have two components: uji which is the information 
the buyer is interested and 002 which tells the seller the type 9 of the buyer. A Pricing Outcomes 
Mechanism will be able to explore this information in order to extract full surplus from the buyer, 
but a pricing contracts cannot. 

Let VLq = {0, 1} and Oq = [n] and consider a probability distribution jiQ on JIq x Oq where 
/io(woi ^0) = 2i-9o ■ 1-2-" • ^ow, define utilities of the buyer as functions ofuo{ujo, 9o, a). As usual, 
we define as a piecewise-linear function Vq^q : [0, 1] — )• M. Consider the function interpolating 
(0, 2^0), 0), (1, 2^'^). Now, let's define = Go and = JIq x ©O; and if 00 = {oji,uj2), then we 
define a probability distribution fi on Q x Q as n{u:,9) = fio{uJi,9) and u{uj,9,a) = UQ{uji,9,a). 

By using a Pricing Mappings Mechanism, the seller is not able to use the information in UJ2, 
and therefore he can only extract 0(1) revenue. If the seller is able to price signals, he can design 
the contract 9 such that the price of (0,9) and (1,9) is 2^ — e and the price of the signals {ojo, 9') is 
00 for 9' ^ 9. Therefore, buyers will he forced to buy the contract for their correct type. 

The implementation above casts the mechanism in the general "Pricing Outcomes" form given 
by Theorem \4-.l\ but a more natural implementation would be for the seller to learn the 9 = uj2 and 
then announce price 2^ — e for the full information signal. 

E Proofs omitted from Section [5] 

Claim E.l In the interactive protocol presented in Example \5.3\ the optimal strategy for an uncom- 
mitted buyer is to play left (to node ti) whenever 9 = and play right (to node S2) whenever 9 = 1, 
and then follow the protocol (make the transfers when asked) without defecting. 

Proof : It is simple to see that a buyer of type 9 = 1 will not play left at the root, since his 
surplus for the information is only ^(0) = 0.4 < 0.533. A buyer of type 9 = playing left and not 
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defecting gets the full information and pays 0.533, so his utility is 1 — 0.533 = 0.467. information 
and pays 0.533, so his utility is 1 — 0.533 = 0.467. To complete the proof we still need to show that 
a buyer of type 9 = would not prefer to play right and that the buyer of type 9 = 1 will follow 
the entire protocol. 

In the node S2, upon receiving £5 both buyer types update their beliefs to a; = 1. Now, upon re- 
ceiving t2, they update their beliefs about co to: ¥{uj\9, ^2) = • '^{t2\^)/ J^u' if^i^'l^) ' IP(*2|'^'))- 
Doing the calculations, we get that P(a; = 1\9 = 0,t2) = 0.1 and ¥{u; = 1\9 = l,t2) = 0.2. By the 
shape of the V0{-) function, no player has increase in utility by this signal. Hence, a buyer of type 
9 = would not play right and then defect in t2- At this point (node t2) the surplus for knowing 
the value of u exactly is 0.9 for 9 = and 0.8 for 9 = 1. So, it is clear that type 9 = 1 would pay 
the amount of 0.8 to get the signal revealed. 

Now, we are left with the question of whether the player of type 9 = would not prefer to play 
right. In both cases (left and right), he gets the full information about the value of uj. Now, we 
need to compare which one has the smallest expected price in his perspective. The price of the left 
path is 0.533 and the left path has price 0.8 • F{t2\9 = 0) = | • 0.8 = 0.5333... ■ 

E.l Proof of Theorem 15.41 

To prove Theorem 15.41 we consider the setting of Example 15. 3i 

We first observe that it is possible to extract revenue 0.5 from committed buyers using a Pricing 
Outcomes with No Positive Transfers Mechanism. The interim belief of buyer of type signal is 
IJ,{lo = 1\9 = 0) = 0.4, so his surplus for the information is .^(0) = 0.6, while for buyer of type 1 his 
interim belief is 0.6 and his surplus is ^(1) = 0.4. By Theorem 14.21 we can extract the full surplus 
^^(0) + ^'^(1) = 0.5 from committed buyers. This can be done by a Pricing Outcomes with No 
Positive Transfers Mechanism in which the seller reveals the value of u and charges t{Lj), where 
t{0) = 1 and t{l) = 0. 

The next lemma completes the proof of the theorem. 

Lemma E.2 For the context (n, /u) with correlated co and 9 that is defined in Example 15.31 it hold 
that there exists some 5 > such that the revenue of any protocol for uncommitted buyers that has 
no positive transfers is at most 0.5 — 5. 

Proof : Fix 6 = 10-^° < lO'^ = e. Let's assume that there is a protocol with no positive transfers 
that is able to extract revenue 0.5 — 6 from uncommitted buyers. We can assume by Theorem 12.61 
that the protocol has the following format: the root is a buyer node with two children and all the 
other nodes of the tree are either seller or transfer nodes. We can also assume that the optimal 
strategy for the 9 = buyer is to take the left path and follow the protocol until then end and that 
the optimal strategy for the 9 = 1 buyer is to take the right path and follow the protocol until the 
end. 

Now, since the surplus of the types are 0.6 and 0.4 respectively, the protocol should extract at 
least 0.6 — 25 from buyer of type and at least 0.4 — 25 from buyer of type 1. We want to claim 
that in such a setting, a buyer of type ^ = can benefit from taking the right path (i.e., behaving 
like a buyer of type ^ = 1) paying strictly less then 0.6 — 25. This generates a contradiction. 

Now, we start analyzing the change in utility for = if he does as ^ = 1, i.e. takes the right 
path. Let L be the set of leaves that are in the right side of the root. And let ■ip'^ G ^(L) for be 
the distributions induced on the leaves for each ui G {0, 1}. Now, if the output of the protocol is 
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leaf I the buyer of type Q updates his behef to: 



In vector notation, the posterior of a buyer of type Q upon observing a leaf i of the right path is: 



i, where 



and q 



= o\e) 



He obtains this posterior with probability X^^^/ ^(i^'l^)V''^ (^) = l^A^q^. Therefore, his total value 
(not counting payments) for taking the right path is: '^li'^^ q^) ■v{^^-g) . The previous expression 
has a slightly notation abuse: we defined v : [0, 1] — t- R+. We overload the function v with 
V : A(0) M+ such that for we define v{q) = v{q{l)). 

Fact E.3 IfY^g{l^A^q^)-v{q^''^) > 1-26, then it must he the case thatY,f^{l^A^q^)-v{q^'^) > 1-36-6. 
In other words, if a buyer of type 9 = 1 can extract value very close to 1 from learning the leaves 
reached by the protocol, then a buyer of type 6 = should also be able to extract values very close 
to 1 from it. 

Proof : First let Li = {£; ^26 < q^'^{l) < 1 - ^26}. First, we claim that EfeLi l*^^*?^ < V26- 
If not, then: 

1 - 25 < • v{q^'^) < (lU^gl) • v{q^'^) + ^ (I'A^q^) ■ v{q^'^) < 

< (1 - V26) {l^A^q^) + 1-Y1 (1*^^^^) = 1-V26Y (t^A^q^) < 

< 1 - V26 • V26 = 1-26 

using the fact that v{q) < max{(7, 1 — q} and the fact that 'Ylit^ ~ [^1] '^'^ definition of . 
Now, in order to get an upper bound for X^^l^*^^?*^) ' v{q^'^) we focus on £ ^ Li. For those vectors, 
we still need to bound f (g"'^). We consider two cases. 



case 1: q^'\l) < 6' := y/26. 



6' > q^\l) 



AW{1) 



A- 



A^O'ZHO) + Ai,q\l) Ai,OA + Ai,Q.& A{, 



00 3(1 - 6') ^ 1 



26' 



26' 



Now, calculating g'^'^(l), we get: 



A{,q\l) 



A{^<dA 



< 



0.4 



4ogO(0)+4igO(i) A^p0.6 + ^fi0.4 - (l/2,5')0.6 + 0.4 

Therefore: u(g°'^) > 1 - 9(2^) = 1-18-5' 
case 2: gi'^(l) > 1 - 5' = 1 - y/26. 



< 26' 



■I \ „i, 



6' > 



(0) 



A 



A^ogi(O) + Ai.q^l) Ai,OA + Ai,0.6 ^ 



11 2(1 - 6') ^ 1 



00 



35' 



35' 
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Now, calculating g'''^(0), we get: 

^ ' Ai^q^iO) + A{^qO{l) A^o0.6 + ^fi0.4 - 0.6 + 0.4(1/35') " 

Therefore: ?;(g°'^) > 1 - (5<5') 

The important fact is that v{q^'^) > 1 — ISV^ for each £ ^ Li. Now, we are ready to bound the 
value that type 6 = gets from the right path: 

(1*AV) • v{q^'^) > (I'A^q^) ■ v{q^'^) > (1 - V26) ■ (1 - 18^26) > 1 - 36 • 5 



Now, for the next step, we bound the total expected payment of buyer ^ = if he decides to 
deviate to the right path and follow the protocol until the end. In order to do so, let the set of 
transfer nodes in the left path and for each i £ N, let ti be the amount associated with it. Also, let 
P(i|a;) be the probability that a buyer following the right branch of the tree reaches i condioned on 
u. Define also the matrix: 

{i\uj = 0) 
F{i\uj = 1) 

Now, remember we had fixed in the beginning a constant e ^ 6 but still e ^ 1. Now the 
expected payment of player of type 6 in the right branch of the protocol is given by: 



A' 



We know that player of type 6 = 1 pays at least 0.4 — 26, so Ylii^^^^Q^) " — 0-^ ~ 2(5. Now, we 
show that the payment of buyer = if he switches to the right branch of the protocol (i.e. moves 
as a buyer of type 6 = 1) is 1 — 0(e), giving him utility 0(e) ^ 26 for doing so. We formalize this 
below. 

Fact E.4 If the payment of player of type 6 = 1 is X]j(l*^*'?^) - ti < 0.4, then the expected payment 
"^■(l^A^q^) ■ ti of the player of type 6 = after moving is at most 0.6 — e(0.4 — e). 

Proof : We will decompose the sum '^■{l^A'^q^)-ti in two parts and bound each of them separately. 
First, we divide the transfer nodes in two classes: 

Ni = {ie N; l^A'q^ < (f - e)l*A'q^} 

N2 = {i£ N; l^A'q^ > (| - e)l^A'q^} 

First note that for all i (even i G iV2), l^A^q^ < ^l^Aiq^ simply because q^ < ^q^ component- 
wise. Now, we can write: 

YC^'A'q^) • t» < (I - e) • I'A'q^ + 1 " ^'^'l^ (1) 

If we show that most of ^^(l*^*g^) • ti is concentrated on A'^i we are done. 

Now, consider some transfer node i G and recall that q^ = [qI] and q^ = [gg]. Therefore: 



iGN2^ 0.6^^0 + 0.4^1i > (I - 6)(0.4yl^o + 0.6A\,) ^ ^ < (2) 



41i - 4e 

^00 



38 



Observe also that once in a node i £ N2, a buyer of type 6 = 1 is wilhng to follow the protocol 
until the end, the expected amount he is paying by following this node, must be at least as big as 
his surplus for the full information at that node. Formally: 

1 - v( , , , ) >ti+ y ti- —r^ (3 



where N{i) is the set of transfer nodes that have i as an ancestor (excluding i itself). In order to 
justify the ti 
is given by: 



justify the term ^t^^' iiotice that if i is an ancestor of i then the probability of reaching j given i 



' ' ¥{i\e) ¥{i\e) 

Now, given i £ N2, since the matrix A^ is of the form [^^^'^ 0{e)]' posterior probability of each 
player in that node is 0(e), so v{q) = w^q where w = Therefore, for i G N2, we can re-write 

equation ([3]) as: 

ti ■ (I'A'q^) + ^ tj- (I'A^q^) < w'A'q^ 
jeN{i) 

where w = 1 — w = [q]. Now, let A^f be the nodes in N2 that have no ancestor in A^2- Therefore: 



Now, we know by equation ([2]) that: 



U-{l'A'q^)+ tyil'A^q^) 

jeN(i) 



< w^A'q'^ < 9 • ^ 4i • 0.6 



E ^ E ^ -^00 < e E n^l^ = 0) < e 

where we are using the definition that is the probability that the protocol reached node i given 
a; = and we are using the fact that X^j^^s P(i|u; = 0) < 1 since the protocol reaching i G N2 are 
disjoint events, since for all paths in the protocol-tree there is at most one node in Therefore: 
X^jgATj ii ■ (1*^*'?"'^) < 9e, which allows us to finish the bound in equation ([1]): 



Taking Fact IE. 31 and Fact IE. 41 together, it is clear that there cannot be a protocol extracting 
0.5 — 6 from uncommitted buyers without transfers from the seller to the buyer. Summarizing the 
argument, if there were such protocol, then the buyer of type 9 = 1 would be faced with the option 
of getting almost full information (i.e. getting a signal that would give him at least 0.4 — 26 value) 
and would pay at most 0.4. If a buyer of type 9 = pretends to be of type 9 = 1, he guarantees 
by fact Fact IE. 31 a contract that has almost full information (gives him value at least 1 — 365) and 
charges him at most 0.6 — 0.4e + (Fact IE.4p . This contradicts the fact that we can extract at 
least 0.6 — 25 revenue from type ^ = 0. ■ 
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